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Preferably, the cells are intact However, cell lysates, homogenates or otherwise disrupted 
cells may be used as a substrate for the present invention and included in the definition of 
the term "cells". 

5 An abnormality may be a lesion, such as a pre-cancerous lesion resulting from a mucosal 
papilloma virus infection in an organism, or may be abnormally proliferating cells 
whether malignant or otherwise. 

The term 'molecule' is used to refer to any molecule having the desired binding properties 
10 referred to above, and in more detail below. Advantageously, the molecule may be an 
antibody, or an antigen binding fragment thereof. 

The binding of the molecule may be assessed by any suitable means, such as by use of a 
fluorescent label, a radioactive label, an enzymatic label, or any other label, or may be 
15 assessed by any other suitable method(s) such as surface plasmon resonance. Preferably, 
binding of the molecule is assessed by fluorescent labelling. 

A "sample" means a suitable quantity of cervical material, in particular cervical cells. 
The sample could take the form of a biopsy, cells from a swab, a smear or other means of 
20 obtaining cervical cells. Preferably, the sample comprises cervical cells. More 
preferably, the sample comprises cells collected from a cervical smear. 

"Simultaneously" means substantially at the same time. In this context, it may mean 
within the same experimental testing procedure, which may itself span many hours. Two 
25 tests will be considered to be performed simultaneously if they are carried out as part of 
one continuous string of operations carried out on the whole or part of the same sample. 
It is a preferred embodiment of the present invention that the several tests described are 
performed on the sample simultaneously. 

30 Molecules binding to human papilloma virus polynucleotides or polypeptides may also 
bind to, or have been raised against, other papilloma virus polynucleotides or 
polypeptides. The papilloma virus referred to herein may include other papilloma type 



WO 02/08764 




PCT/GB01/01176 



6 



viruses. Preferably, the virus is human papilloma virus. More preferably, the virus is 
human papilloma virus and comprises one or more types selected from the group 
consisting of HPV types 16, 18, 33, 35, 45, 51, 52, 56, 58 and 61. Molecules according to 
the invention may bind to numerous viral polynucleotides or polypeptides. 

Advantageously, the molecules according to the invention bind to HPV polypeptides. 
HPV DNA may be found in a proportion of cervical smears with no apparent viral 
infection or pathology. Hence, the use of molecules binding to HPV polypeptides 
provides a superior assay to tests for HPV nucleic acid. Preferably, at least one molecule 
is a molecule that binds specifically to a subset of HPV E4 proteins. More preferably at 
least one molecule is a molecule capable of binding to the papilloma virus-E4 protein, and 
is capable of binding within a hydrophilic region of the E4 sequence. 

The invention preferably involves the detection of HPV proteins and markers of cell 
proliferation, or proteins that are expressed in the cell as a result of the presence of viral 
gene products such as pl6 (see Sano, T. et al (1998) Am. J. Pathol. 153, 1741-1748; Sano, 
T., et al (1998) Pathol. Int. 48, 580-585; Lukas, J., et al (1994) J Cell Biol 125, 625-638.). 
Molecules binding to cell proliferation markers may advantageously be used in the 
methods of the present invention. Cell proliferation markers are gene products which are 
expressed in actively dividing cells, or cells which are committed to. or are entering the 
cell cycle. These markers are generally absent from cells which are quiescent, dormant, in 
stationary phase or otherwise arrested either temporarily or permanently and are not 
participating in the cell cycle. Preferably cell proliferation markers according to the 
present invention comprise one or more polypeptides which are members of a 
preinitiation complex of DNA replication. More preferably the proliferation marker 
comprises one or more polypeptides selected from the group consisting of CDC6, MCM2, 
MCM3, MCM4, MCM5, MCM6, MCM7, Cdc7 protein kinase, Dbf4, Cdcl4 protein 
phosphatase, cyclin A, Cdc45, PCNA, Ki67, KiSl, and MCM10. Cell proliferation 
markers may be detected at the mRNA stage. Preferably cell proliferation markers are 
detected as polypeptides. 
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In a preferred aspect, the cell proliferation marker is not an MCM molecule. 
Advantageously, it is PCNA, Ki67 and cyclin A. 



The cell proliferation marker may moreover be replaced with a marker for viral activity, 
5 such as cyclin B or pi 6. 

Molecules suitable for use in the methods of the present invention may bind to more than 
one target polypeptide. The invention accordingly provides a method for detecting 
abnormalities in a sample from a patient, said method comprising; obtaining a sample of 
10 the patient's cells; contacting said cells with two or more molecule(s), wherein at least 
one molecule is capable of binding a papilloma virus associated nucleic acid or antigen or 
a plurality of papilloma virus associated nucleic acid or antigens, and . at least one 
molecule is capable of binding a cell proliferation marker or a plurality of cell 
proliferation markers, and monitoring said binding. 

15 

Reagents described and referred to herein may - be advantageously supplied with 
instructions for the working of the invention in kit form. The present invention 
accordingly provides a kit comprising two or more molecule(s), wherein at least one 
molecule is capable of binding a papilloma virus associated nucleic acid or antigen, and at 
20 least one molecule is capable of binding a cell proliferation marker, and instructions for 
use of said molecules in a method described herein. 

The invention moreover provides a molecule is capable of binding a papilloma virus 
associated nucleic acid or antigen, and a molecule is capable of binding a cell 
25 proliferation marker, for simultaneous, simultaneous separate or sequential use in the 
detection of abnormalities in a sample. 

The ability of the invention to assess two separate indicators of cellular abnormality 
which can lead to cervical tumours permits the categorisation of lesions according to the 
30 severity of the risk posed to the patient, and therefore allow a clinician to prioritise urgent 
treatment in more serious cases. 
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A pattern of events necessary for completion of the virus life cycle is presented herein. In 
the lower layers of all productive infections including those caused by HPV1, 2 and 1 1, 
staining for E2F-activated genes such as PCNA or MCM is widespread. The extent to 
which these cells extend into the upper layers varies depending on the infecting HPV type, 
5 but in productive infection such proteins are never found in the surface layers. E4 
expression begins in these replication competent cells as might have been predicted from 
previous studies showing that E4 expression coincides with the onset of genome 
amplification (Doorbar, J., Foo, C, Coleman, N., Medcalf, E., Hartley, 0., Prospero, T., 
Napthine, S., Sterling, J., Winter, G. & Griffin, H. (1997) Virology 238, 40-52). Results 

10 obtained with different HPV types confirm the close association between vegetative viral 
DNA replication and E4 expression. The eventual loss of PCNA and MCM staining in 
these E4-positive cells marks completion of viral DNA replication and that packaging of 
the HPV genomes into infectious virions can occur. The molecular events which regulate 
this final stage of the virus life cycle are poorly understood, but it is likely that packaging 

15 is linked to the completion of genome amplification as it is in other DNA viruses (Wiley, 
S. R., Kraus, R. I, Zuo, F. G., Murray, E. E., Loritz, K. & Mertz, J. E. (1993) Genes Dev. 
7, 2206-2219). The most striking observation to arise from the present work is that every 
cell supporting productive infection will expresses one or more of these proteins. The 
particular combination of proteins present in any particular cell identifies a particular 

20 stage in the virus life cycle. 

The present invention , demonstrates that the histological status of the cervix can be 
predicted from the expression pattern of just three proteins (E4 A L1 and a marker of 
proliferation or viral infection) in cells in the upper epithelial layers. The different 
25 combinations of these three markers in cells in the surface layers of the epithelium are 
predictive of disease grade. The risk associated with cellular abnormalities is stratified as 
follows: 

A. High risk- immediate treatment required. Positive for cellular proliferation markers; 
30 positive or negative for HPV infection. Indicative of a high grade lesion (Cancer, HSIL, 



CIN3 orCIN2) 
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B. Intermediate risk; early follow-up required. Positive for high-risk HPV type, but 
proliferation marker negative. Indicative of a low grade lesion (condyloma, LSIL or 
CIN1) caused by intermediate risk virus. 

C. Lower risk; early follow-up recommended. Positive for intermediate risk HPV, but 
proliferation marker negative. Indicative of a low grade lesion (condyloma, LSIL or 
CEM1) caused by low risk virus. 



D. Low risk; standard follow-up. Negative or low risk HPV; proliferation marker 
negative. 

The invention accordingly further provides a method for assessing the risk associated with 
15 cellular abnormality in a patient sample, comprising obtaining a sample of the patient's 
cells; contacting said cells with two or more molecule(s), wherein at least one molecule is 
capable of binding a papilloma virus associated nucleic acid or antigen, and at least one 
molecule is capable of binding a cell proliferation marker or a plurality of cell 
proliferation markers and/or marker(s) of viral infection; and categorising the risk 
20 according to (a) the presence of cell proliferation markers, and (b) the detection of high or 
low risk HPV virus infection. The use of E4 as a marker is highly preferred over the use 
of nucleic acid, as it allows active virus infection to be examined whereas the use of DNA 
probes does not distinguish between active virus infection and latent infection. HPV 
DNA can be found in around 30% of women with normal cervical morphology and thus 
25 the presence of DNA does not provide an indication of disease status. E4 expression is 
inversely correlated with the severity of cervical disease. E4 is never expressed in tissue 
with normal histology. 

Brief Description of the Drawings 



10 



30 



Figure 1A shows the amino acid sequence of HPV 16 E4 protein and the binding sites of 
various antibody molecules or E4-specific antigen-binding fragments of antibodies; 




10 
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Figure IB shows the sequence of the E4 protein from HPV16 (top row), HPV1 (bottom 
row) and a consensus sequence (middle row), and the binding sites of various antibodies 
or antigen-binding variants of antibodies; 

Figures 2A-2D show four sensograms (arbitrary response units against time in seconds) 
obtained using surface plasmon resonance apparatus; 

Figures 3-8 are micrographs showing variously stained samples, as explained in the text; 
Figure 9 is an amino acid sequence alignment of part of HPV E4 proteins; 



Figure 10 shows the distribution of E4 and MCM protein in a range of HPV 16 induced 
HSIL. Panel (A): HSIL low (464); Low power image; brown = MCM; red = E4. Panel 

15 (B): HSIL (464); brown = MCM; red = 16 E4. Panel (C): HSIL (11431); red = MCM; 
brown = E4. Panel (D); HSIL (11433): brown = MCM; red = 16 E4. Panel (E): HSIL 
(15919); Brown = MCM; red = 16 E4. In all images the nuclear signal indicates the 
presence of cellular MCM protein. Expression of MCM is induced by E2F and is absent 
from the surface layers of histologically normal epithelium as shown in other figures in 

20 this patent. MCM staining is abundant in HSIL due to the extensive expression of E7. 
The cytoplasmic signal present in the surface layers of some HSILs shown (464, 11433, 
15919) indicates the presence of the viral E4 protein. Typically only a few cells at the 
epithelial surface are positive indicating that stimulation of the late stage of the virus life 
cycle is incomplete. 

25 

Figure 11 shows the distribution of E4 protein and MCM in a range of HPV 16 induced 
LSIL. Panel (A): LSIL-low (2565) Low power image; red = MCM; brown = E4. Panel 
(B): LSIL (2565); red = MCM; brown = E4. Panel (C): LSIL ( 4165); red - MCM; brown 
« E4. Panel (D): LSIL (10451); red = MCM; brown - E4. Panel (E): LSIL (5388); red = 
30 MCM; brown = E4. The cytoplasmic staining in the upper epithelial layers is E4. The 
nuclear staining which is most abundant in the layers below those expressing E4, shows 
the distribution of MCM. The nuclear staining pattern for MCM extends above the basal 
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layer but does not reach the surface. E4 expression is much more abundant in the surface 
layers of LSIL than MCM. 

Figure 12 shows double staining for junction region E4/MCM in the surface layers of 
5 normal cervical epithelium. Panel A: LSIL edge; red = MCM; brown = E4. Panel B: 
HSIL edge; red=E4; brown = MCM. Cytoplasmic staining in the surface layers is due to 
the presence of E4. Nuclear staining beneath this shows the distribution of MCM 
proteins. The two images show the junction between a' LSIL and normal tissue or 
between a HSIL and normal tissue. The cytoplasmic E4 staining is absent in the normal 
10 tissue (to the right of each image) and the nuclear MCM signal is confined to cells in or 
close to the basal layer. 

Figure 13 shows combining staining for E4 in HPV1, HPV2 and HPV16 with MCM in 
immunofluorescence images. Panel A: HPV1 wart; red = MCM5; green = E4; blue = 

15 DAPI nuclear counterstain. Panel B: HPV2 wart; red = MCM5; green = E4; blue = DAPI 
nuclear counterstain. Panel C: HPV 16 LSIL; red = MCM5; green = E4; blue = DAPI 
nuclear counterstain. E4 expression always follows MCM expression on low-grade 
lesions caused by other HPV types. Lesions caused by HPVl(wart), HPV2 (wart) and 
HPV16 (LSIL) are shown. In all cases, nuclear MCM staining is present in the lower 

20 epithelial layers whereas cytoplasmic E4 staining is found to begin in these cells as they 
migrate towards the epithelial surface. Cells positive for both E4 and MCM are present 
at the interface. As E4 expression coincides with the onset of genome amplification, this 
is likely to be a requirement for successful amplification of the viral genome. 

25 Figure 14 shows double-staining experiments which demonstrate that E2F-induced 
proteins can be substituted for MCM. Panel A: PCNA/E4 LSIL; Panel B: PCNA/E4 LSIL 
high power, Panel C: PCNA/E4 HSIL; Panel D: PCNA/E4 HSIL high power. All images 
red = PCNA, green = E4, blue = DAPI. Panel E: cyclin A/E4 LSIL; Panel F: cyclin A/E4 
LSIL high power; Panel G: cyclin A/E4 HSIL; Panel H: cyclin A/E4 HSIL high power; 

30 Panel I: cyclin A/E4 HSIL 2; Panel J:cyclinA/E4 HSIL high power 2; All images, red = 
cyclin A, green = E4, blue = DAPI. The nuclear protein PCNA is activated by E2F and 
shows a similar expression pattern to MCM. Its expression (nuclear staining) is most 
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extensive in the epithelial cell layers below those in which E4 expression (cytoplasmic 
staining) is abundant. Expression of PCNA in the surface layers of HSIL is more abundant 
than in the surface layers of LSIL. In HSIL, In HSIL, E4 expression is restricted to small 
pockets of cells which show some evidence of morphological differentiation. 

5 

Cyclin A is also activated by E2F. In cervical neoplasia caused by HPV16 it has a 
cytoplasmic and nuclear distribution and is found in sporadic cells in the basal epithelial 
layers and above. E4 (cytoplasmic staining) is detectable in the cytoplasm of cells in the 
layers above those that are expressing cyclin A. In HSIL, cyclin A expression extends 
10 into the surface epithelial layers, where as in LSIL, cyclin A rarely extends above the 
intermediate layer. As shown for MCM and PCNA, E4 expression in the surface layers is 
most abundant in regions where cyclin A expression is absent from the surface. 

Figure 15 is a further double-staining showing that other markers of cell proliferation can 
15 be substituted for E2Finduced genes. Panel A: CyclinB/E4 LSIL. Panel B: cyclin B/E4 
LSIL high power. Panel C: cyclinB/E4 HSIL. All images: red = cyclinB,- green = E4, 
blue = DAPI. Both cyclin B and E4 show cytoplasmic staining patterns. In LSIL, cyclin 
B is found in the intermediate cell layers and below. E4 is expressed in sporadic cells in 
the intermediate layer and in layers above this. In HSEL, cytoplasmic cyclin B and E4 can 
20 both be found in cells near to the epithelial surface. 



Detailed Description of the Invention 

25 THE CERVIX 

The cervical epithelium is essentially composed of two distinct cell types; the squamous 
epithelium and the columnar epithelium, each of which is located in an anatomically 
distinct region of the tissue. The squamous epithelium is located at the exterior aspect 
30 (the ectocervix) of the cervical opening (os), while the columnar epithelium extends into 
the endocervical canal (the endocervix). These two distinct epithelial cell types come into 
contact in the vicinity of the cervical os, the squamo-columnar junction. The squamo- 
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columnar junction is of clinical importance as it is the region where the majority of 
malignancies arise. For diagnostic validity, a cervical smear sample should include cells 
from this region. In order to ensure that this has been achieved, a smear should contain 
columnar as well as squamous epithelial cells. 

5 

Most cervical tumours arise at the squamo-columnar junction from the squamous 
epithelium, which is multilayered dynamic stem cell system under constant renewal. The 
stem cell compartment itself is located adjacent to the basement membrane within the 
basal cell layer. Stem cell division gives rise to parabasal, intermediate, and superficial 
10 cell derivatives. These are conventionally defined in terms of both their characteristic 
morphology and location within the squamous epithelium. The transition from basal cells 
located in the deepest layer of the squamous epithelium, to superficial cells at its surface 
is associated with progressive differentiation and a loss of proliferation until superficial 
squamous epithelial cells at the cervical surface are terminally differentiated. 

15 

In dysplasia, there is increased cellular proliferation with a reduction in differentiation of 
cells as they progress through the squamous epithelium. Typically, for convenience in the 
first instance, cervical screening involves assessment of smears taken from the surface of 
the epithelium, looking for abnormalities at the surface representative of reduced 
20 differentiation as a result of dysplasia. 

At the late foetal stage, during adolescence and in pregnancy columnar epithelium is 
replaced at the junction by squamous epithelium by a process of metaplasia. Metaplastic 
squamous cells which replace columnar cells are particularly vulnerable to carcinogens. 

25 Normal metaplasia should not be confused with abnormal dysplasia within the squamous 
epithelium. The majority of cases of cervical malignancies represent squamous cell 
carcinoma (SCC) and are strongly associated with infection with 'high-risk- types of 
human papillomavirus, such as 16, 18 and 31 (Park et al Cancer, 1995, 76 (10 Suppl.) : p. 
1902-13). Cervical carcinoma is amenable to prevention by population screening, as it 

30 evolves through well-defined non-invasive 'intraepithelial' stages (Wright, et al. 
Precancerous lesions of the cervix, in Blaustein's pathology of the female genital tract 
RJ. Kurman, Editor. 1994, Springer-Verlag: New York. p. 229-78. Squamous 
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intraepithelial abnormalities may be classified using 3 tier (ON) or 2 tier (Bethesda) 
systems. Different histological abnormalities broadly correlate with the type of infecting 
HPV and with the DNA ploidy, clonality and natural history of the lesion. As classified 
by the Bethesda system, low grade squamous intra-epithelial lesions (LSIL), 

5 corresponding to CIN1 and cervical HPV infection (HPVI) generally represent productive 
HPV infections, with a relatively low risk of progression to invasive disease (Wright and 
Kurman. A critical review of the morphological classification systems of preinvasive 
lesions of the cervix: the scientific basis for shifting the paradigm, in Papillomavirus 
reviews: current research on papillomaviruses, C Lacey, Editor. 1996, Leeds University 

10 Press: Leeds). High grade squamous intra-epithelial lesions (HSIL), corresponding to 
CIN2 and CIN3, show a higher risk of progression than CIN1 (LSIL) though both axe 
'viewed as representing potential precursor of malignancy. Although it is possible to 
estimate the approximate risk of malignancy for each category of intra-epithelial lesion, it 
is currently not possible to determine the approximate likelihood of progression for an 

15 individual case. 

The major reasons why cytological screening sometimes fails to detect cervical cancer are 
the large intervals between tests and also the high number of false negative results (10- 
30%) (Pap Cytology screening: Most of the benefits reaped? WHO and EUROGIN 
20 release a report on cervical cancer control Press Release WHO/25, March 1997). 

The high number of false negative results reflects the fact that interpretation of Pap 
smears is one of the most difficult of morphological exercises. The results of a Pap smear 
are harder to interpret than those of fine needle aspiration, body fluid cytological testing 

25 or biopsies because of the complexity and variability of the mixed cell population placed 
on the smear and the wide range of inflammatory and reparative processes that occur in 
the cervix. There are also cyclical changes in the cellular population, pregnancy induced 
alterations and alterations that occur in the postmenopausal period. Because 
gynaecological cytology is so difficult, the training periods for cytotechnologists are long; 

30 they require an educated student and high degree of discipline and pattern recognition 
skills. Even after completing an adequate training programme, cytotechnologists require 
several years of practical experience before they can make consistently accurate 
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judgements as to whether a Pap smear result is normal or abnormal. Similarly, although 
pathologists may be trained to interpret histological sections, they require specialised 
additional training in cytopathology to possess adequate skills to organise and supervise 
the cytology laboratory and to make appropriate diagnoses concerning abnormal smears. 



The two major problems associated with current screening programmes are an apparently 
unavoidable false negative rate (10-30%) and the relatively high cost of screening. 
Therefore, alternative approaches are now being considered to cervical screening. The 
two most commonly discussed proposals are to use HPV DNA testing and typing as a 
10 primary screening modality or as a supplement to Pap smears and to use instruments that 
can automatically screen conventionally taken Pap smears, thus reducing the need for the 
relatively highly paid cytotechnologists and cytopathologists (Richart, Cancer 



Supplement, 1995, 76(10): 1919-1927; Birdsong, Human Pathology, 1996, 27(5): 468- 



A number of companies are currently developing and marketing automated screening 
instruments. In general such instruments use a high resolution video scanner to capture 
images, which are then digitised and analysed with a series of algorithms, and the data are 
then passed through an interference network through which the machine has been trained 

20 to distinguish between normal and abnormal cellular components. It is hoped that with 
further software and hardware development, automated screening can be considered for 
primary screening, though at the moment no devices have been approved for the pre- 
screening or independent screening of Pap smears by the US FDA. That companies are 
prepared to invest so heavily in such an expensive and complex approach in attempting to 

25 overcome problems with conventional PAP smear testing illustrates the severity of the 
problems and the long-felt need for a solution. 



5 



481). 



15 



CELL PROLIFERATION MARKERS 



30 Measuring parameters of cell proliferation can provide objective information about 
tumours, for example using cell proliferation markers such as PCNA, Ki67 etc. The most 
widely studied markers of proliferation are Ki67 and PCNA (proliferating cell nuclear 
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antigen) (Yu and Filipe, Histochemcial Journal, 1993, 25: 843-853). PCNA is involved in 
the elongation of DNA replication and in the mechanism of DN A repair. Therefore it is 
present during actual DNA synthesis by replication or repair. 

5 There are also useful markers involved in the earlier initiation stage of DNA replication. 
These include Cdc6 and proteins of the MCM2-7 family (MCM2, MCM3, MCM4, 
MCM5, MCM6 and MCM7). Williams et al (1997) (Proc. Natl. Acad. Sci. USA, 1997, 
94: 142-147) reported that human HeLa Cells in culture express Cdc6 throughout 
proliferating cell cycles, but that WI38 human diploid fibroblasts stop expressing Cdc6 

10 when made quiescent by serum starvation. It is shown herein that these observations 
extend to other cell lines and other species. MCMs are present in Gl phase nuclei (before 
DNA synthesis) and are progressively displaced from chromatin into the soluble 
nucleoplasm during DNA synthesis. It is shown that they are absent from chromatin 
during quiescence. It is also known that MCM5 is absent from differentiated cells of the 

15 uterine cervix and breast. Cdc6 antibodies or MCM (e.g. MCMS) antibodies are known 
to detect LSIL (HPVI/CIN 1) lesions of the cervix. Furthermore, essentially all cells of 
LSIL (HPVI/CIN 1) or HSIL (CIN 2/3) lesions are stained. This indicates that specific 
binding molecules directed to proteins of the preinitiation complex of DNA replication, 
particularly Cdc6 or MCM proteins (such as MCM5, MCM2, MCM3, MCM4, MCM6 

20 and MCM7) are useful for detection of atypical or neoplastic cells. 

Useful cellular proliferation markers may include human homologues of yeast 
components, such as Cdc7 protein kinase (Chapman and Johnston, Exp. Cell Res., 1989, 
180 419-428 (yeast), Sao et al 1997, EMBO J., 16, 4340-4351 (human - down-regulated 

25 in quiescence)), Dbf4, the regulatory subunit of Cdc7 protein kinase (Jackson et al, 1993, 
Mol Cell Biol 13 2899-2908 (yeast), Masai et al t Cold Spring Harbor Meeting on 
Eukaryotic DNA Replication, 3-7 September 1997 (human)), Cdcl4 protein phosphatase 
(Hogan and Koshland PNAS USA, 1992, 89, 3098-3102 (yeast)), Cdc45, which associates 
with and has a similar phenotype to MCMs (Zou et al, Mol Cell Biol, 1997, 17, 553-563 

30 (yeast), Takisawa et al, Cold Spring Harbor Meeting On Eukaryotic DNA Replication , 
3-7 September 1997 (Xenopus)), MCM10, which associates with and has a similar 
phenotype to MCMs (Merchant et al, 1997, Mol Cell Biol 17 3261-3271) as well as 
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replication factors such as ORC2. Target polypeptides of the present invention may 
variously be said to be any of components of the DNA pre-replicative complex, 
components of replication competent chromatin, involved in restricting DNA replication 
to once per cell cycle, components of the replication licence, involved in licensing 
chromatin for a single round of DNA replication, and assembled at replication origins 
before initiation of DNA replication. 

Moreover, both cyclin A and the viral activity marker cyclin B are useful markers in the 
invention, as is pi 6. 

The sequence of human cyclin B mRNA is disclosed in Jackman et al, EMBO J. 14 (8), 
1646-1654 (1995); see GenBank Acc. No. NM_004701.2 GL10938017. 



The human cyclin A mRNA sequence is available on GenBank accession no. 
15 XM 003325.1 GI:11446079. 



The sequence of human pi 6 is available on GenBank accession no. XM_005656.1 
GI: 11429662. 



20 Human Cdc6 amino acid sequence is disclosed in Williams et al t 1997, PNAS USA 94: 
142-147, GenBank Acc. No. U77949. 

Human MCM2 sequence is disclosed in Todorov et al, 1994,X Cell Sci., 107, 253-265, 
GenBank Acc. No. X67334. 

25 

Human MCM3 sequence is disclosed in Thommes et al, 1992, Nucl Acid Res., 20, 1069- 
1 074, GenBank Acc. No. P25205. 

Human MCM4 sequence is disclosed in Ishimi et al, 1996, J. Biol Chem., 271, 24115- 
30 24122, GenBank Acc. No. X74794. 
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Human MCM5 sequence is disclosed in Hu et al, 1993, Nucleic Acids Res., 21, 5289- 
5293, GenBank Acc. No. X74795. 

Human MCM6 sequence is disclosed in Holthoff et al, 1996, Genomics, 37, 131-134, 
5 GenBank Acc. No. X67334. 

Human MCM7 sequence is disclosed in Hu et al t 1993, Nucleic Acids Res., 21, 5289- 



10 Human Cdc6 has been cloned by Williams et al whose paper {PNAS USA 94: 142-147, 
1997) provides the full amino acid sequence. Anti-Cdc6 binding molecules are very 



preferably smears. This compares with no binding to normal cervical tissue in a smear 



The amino acid sequence for human MCM5 is disclosed in Hu et al, 1993, Nucleic Acids 
Res., 21, 5289-5293, GenBank Acc. No X74795. Binding molecules directed against it, 
like Cdc6, are very effective in marking abnormality in various tissues, especially cervical 
samples, preferably smears. 



Binding molecules directed against MCM2, against MCM3, against MCM4, against 
MCM6 or against MCM7 are also effective in marking abnormality in tissue samples such 
as cervical smears. 

25 Thus, binding of (e.g.) an anti-Cdc6 or anti-MCM specific binding member to a sample 
provides for categorising the tissue from which the sample is derived as abnormal, 
potentially or actually pre-cancerous, dysplastic or neoplastic. In accordance with present 
practice upon obtaining a positive result using the Pap test, a positively-testing individual 
may be investigated further, for instance by means of biopsy testing and/or repeat 

30 screening. It is quite common for pre-cancerous potential not to result in an actually 
cancerous state. Six-monthly screening is typically used to follow progression or 
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effective in marking abnormality in various tissues, especially cervical samples, 



sample. 
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regression of dysplasia to allow for appropriate and timely therapeutic intervention if 
required. 

The present invention may be used to pre-screen samples before further analysis. The 
5 present invention may be used for screening or analysis of samples previously tested using 
an available technique, such as a Pap smear test or ThinPrep 2000 test Thus, a cervical 
smear for example may be tested using both the conventional Pap smear test and a test in 
accordance with the present invention. 

10 In a further aspect the present invention provides a method for determining, assessing or 
diagnosing the presence or absence of abnormal cellular proliferation, cellular growth 
abnormality, dysplasia, neoplasia, or a potentially or actually pre-cancerous or cancerous 
state in a tissue or sample thereof. 

15 KITS 

Binding molecules may be provided in a kit, which may include instructions for use in 
accordance with the present invention. Such kits are provided as a further aspect of the 
present invention. One or more other reagents may be included , such as labelling 

20 molecules, and so (see below). Reagents may be provided within containers which 
protect them from the external environment, such as a sealed vial. A kit may include one 
or more articles for providing the test sample itself depending on the tissue of interest, e.g. 
a spatula for taking a cervical smear, (such components generally being sterile). A kit 
may include any, any combination of, or all of; a blocking agent to decrease non-specific 

25 staining, a storage buffer for preserving binding molecule activity during storage, staining 
buffer and/or washing buffer to be used during antibody staining, a positive control, a 
negative control The design and use of controls is standard and well within the routine 
capabilities of those of ordinary skill in the art. 
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SAMPLES 



Samples may be removed from the body using any convenient means and technique. For 
cervical screening, standard smear samples may be employed. Alternatively, the 
5 ThinPrep 2000 technology (Cytec Corp, Boxborough, Mass., USA) may be used. This 
has been cleared by the US FDA as a replacement for the conventional method of Pap 
smear preparation. A sample is collected into a liquid medium instead of smearing the 
cells onto a glass slide. An automated processor (the ThinPrep 2000 machine) is later 
used to collect the cells from the liquid and deposit them in a thin layer on a glass slide for 
10 analysis. A spatula or swab may be used to remove endothelium cells from the cervix. 



The binding of molecules such as an antibody to samples may be determined by any 
appropriate means. Tagging with individual reporter molecules is one possibility. The 
reporter molecules may directly or indirectly generate detectable, and preferably 
15 measurable, signals. The linkage of reporter molecules may be directly or indirectly, 
covalently, e.g. via a peptide bond or non-covalently. Linkage via a peptide bond may be 
as a result of recombinant expression of a gene fusion encoding binding molecule (e.g. 
antibody) and reporter molecule. 

20 One favoured mode is by covalent linkage of each binding member with an individual 
fluorochrome, phosphor or laser dye with spectrally isolated absorption or emission 
characteristics. Suitable Fluorochrome include fluorescein, rhodamine, phycoerythrin and 
Texas Red. Suitable chromogenic dyes include diaminobenzidine. 

25 Other reporters include macromolecular colloidal particles or particulate material such as 
latex beads that are coloured, magnetic or paramagnetic, and biologically or chemically 
active agents that can directly or indirectly cause detectable signals to be visually 
observed, electronically detected or otherwise recorded. These molecules may be 
enzymes which catalyse reactions that develop or change colours or cause changes in 

30 electrical properties, for example. They may be molecularly excitable, such that 
electronic transitions between energy states result in characteristic spectral absorptions or 
emissions. They may include chemical entities used in conjunction with biosensors. 
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Biotin/avidin or biotin/streptavidin and alkaline phosphatase detection systems may be 
employed. Further examples are horseradish peroxidase and chemiluminescence. 

Those skilled in the art are able to choose a suitable mode of determining binding 
5 according to their preference and general knowledge. For example, horseradish 
peroxidase or alkaline phosphatase, may be used. 

PAPILLOMAVIRUSES 



10 Studies on nataally-occurring warts have revealed the virus to encode three late proteins - 
LI and L2, which are virion coat proteins (Doorbar ei al, 1987), and E1 A E4, a 
non-structural late protein of unknown function (Doorbar et al, 1986). In HPVl-induced 
warts the E1 A E4 protein is first expressed in cells of the lower spinous layer, and 
assembles into distinctive cytoplasmic and nuclear inclusions. During terminal 

15 differentiation it is post-transcriptionally modified by phosphorylation (Grand et al, 1989) 
and by removal of sequences from the N-tenninus (Doorbar et al, 1988; Roberts et al, 
1994). The E1 A E4 proteins of high risk viruses have been characterised, for example in 
W098/25145. Of these viral proteins, E1 A E4 is found to be much more abundant than 
either LI or L2 where it has been estimated (Doorbar et al 1987). Moreover, E1 A E4 

20 precedes both LI and L2 in its expression and its expression is preserved into high grade 
lesions even when LI is lost. 

The sample of patient's cells may comprise skin cells (e.g. in the case of warts, veruccas 
and the like, caused by cutaneous HPV infections). Cutaneous lesions, such as those 

25 induced by HPV types 5, 8, 14, 17, 20, are difficult to manage clinically, and are often 
associated \yith malignancies in immunosuppressed patients (Benton et al, 1992 
Papillomavirus Reports 3, 23-26). Alternatively, the sample may comprise mucosal cells, 
especially cervical cells, in the case of HPV infections of the urinogenital tract. Methods 
of obtaining and preparing such samples for use in the method of the invention are known 

30 to those skilled in the art or will be apparent from the present disclosure. 
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The term "pre-cancerous cervical lesions" is intended to refer to those abnormalities 
which clinically may be described as "pre-malignant" conditions and which may, without 
treatment, proceed to full malignancies. As set forth above, such lesions are screened for 
routinely by, for example, cervical smear testing. The present invention allows for cells 
5 obtained from patients by methods such as cervical smears to be tested more accurately 
and more quickly for HPV infection, simultaneously with testing for the later stages of 
malignancy. 

SCREENING FOR NUCLEIC ACIDS 

10 

Methods for screening cells for nucleic acids derived from HPV are well known in the art. 
For example, a test known as "Hybrid Capture" is available from Digene 
(www.digene.com) for the detection of a number of organisms, including HPV. Current 
"Hybrid Capture 2" technology is described in literature available from Digene. Hybrid 
15 capture technology relies on RNA probes which hybridise to target DNA present in 
infected cells. The DNA/RNA hybrids are detected using alkaline phosphatase 
conjugated antibodies and a chemiluminescent substrate. 

Nucleic acid hybridisation using DNA or RNA probes is generally applicable to the 
20 detection of HPV. Alternative systems may rely, for example, on PCR of HPV nucleic 
acids using appropriate primers. 

ANTIBODIES 

25 

Preferably, a molecule according to the invention -protein comprises an antibody molecule 
or an antigen-binding variant thereof, such as a Fab, Fv, scFv, "diabody" and the like. 
The molecule may comprise monoclonal or polyclonal antibodies, or antigen-binding 
portions of antibodies selected from libraries by screening (e.g. using- phage display 
30 technology). Alternatively the molecule may be some other polypeptide, peptide, a 
synthetic compound or an RNA or DNA aptamer, generated by a procedure such as 
SELEX. In some preferred embodiments the molecule comprises a label moiety, such as 
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a fluorophore, chromophore, enzyme or radio-label, so as to facilitate monitoring of 
binding of the molecule to E4 protein. Such labels are well-known to those skilled in the 
art and include, for example, fluorescein isothiocyanate (FITC), P-galactosidase, 
horseradish peroxidase, streptavidin, biotin, 35 S or I25 L Other examples will be apparent 
to those skilled in the art. The label may in some instances be conjugated to the antibody 
or antigen-binding variant, or may be present (where the label is a peptide or polypeptide) 
as a fusion protein. 

Preferably at least some of the molecules used in the method of the invention bind 
selectively to the E4 protein of a certain HPV type or types, but not to the E4 protein of 
other HPV types. Accordingly, in one embodiment the invention can be used to 
determine the type or types of HPV infecting a patient. This is very significant, as 
progression to malignant disease (and hence clinical prognosis) is heavily dependent on 
HPV type. Accordingly, the invention provides a method of determining the type(s) of 
HPV infection in a patient, the method comprising the steps of: obtaining a sample of the 
patient's cells from the site of HPV infection; contacting the cells with a molecule that 
binds specifically to a subject of HPV E4 proteins; and monitoring said binding. 
Preferably, said molecules bind type-specifically to the E4 polypeptide, allowing the 
identification of high-risk HPV types. 

In a further aspect the invention teaches the use of an antibody molecule, or an antigen- 
binding variant thereof, which binds specifically to HPV E4 protein in the region of 
amino acid residues RPIPKPSPWAPKKHRRLSSDQDSQTP of HPV16 E4 protein, or 
the corresponding hydrophilic, acidftase-rich region of other HPV E4 proteins. 

In a method according to the invention, the subset of E4 proteins to which at least one 
molecule binds may consist of a single HPV type E4 protein, or may consist of a plurality 

of E4 proteins of different types. If the E4 proteins to which the molecule binds do not 

- 

encompass the E4 proteins of all known HPV types, then binding or non-binding (as 
appropriate) of the molecule to the E4 protein present in the cell sample may allow an 
investigator to make certain deductions about the identity of the HPV type(s) infecting the 
patient. 



WO 02/08764 ^^CT/GB01/01176 

24 



In practice it may be advantageous to employ a plurality of different molecules, which 
bind to different HPV proteins. This may be helpful in identifying the type(s) of HPV 
infecting a patient, although it is not essential as a prognostic indicator. For example, the 
5 ability to limit the infecting HPV type(s) to a particular subset (or exclude such a subset) 
may be advantageous. By way of explanation, it is known that mucosal HPV types 6, 1 1, 
42, 43 and 44 are associated with external genital papillomas (condylomata accuminata) 
which have a low risk of progression to cancer, but are difficult to eradicate and are 
disruptive to the lives of the patients. The higher risk mucosal types (31, 33, 35, 51, 52, 
10 58, 61 and 16, 18, 45, 56) cause asymptomatic flat warts (flat concyloma) which can 
progress to high grade cervical intraepithelial neoplasia (CIN) and cancer. The highest 
risk of progression to malignancy is associated with lesions caused by HPV types 16, 18, 
45 and 56. 

15 Different probes specific for different subtypes of HPV may be labelled with different 
labels. For example, high-risk HPV-specific probes may all be labelled with a single 
label; low risk HPV-specific probes may be labelled with a single but different label. The 
test operator would then easily be able to determine the risk associated with the sample 
according to the label. Fluorescent labels would be particularly advantageously employed 

20 in such circumstances. 

Sorting of cells into risk categories according to label may be done in a variety of ways, 
and may be automated, using detectors sensitive to differences in label emissions, such as 
fluorescence frequency and/or intensity. In a particularly advantageous embodiment, cells 
25 may be sorted by FACS; a prevalent distribution of cells into a high-risk group indicates 
the presence of infection by high-risk HPV. 

Molecules which bind to particular HPV types, but not to other HPV types, may be 
generated for example by randomisation and selection techniques. These include phage 
30 display, and other techniques suitable for displaying antibodies or other polypeptides; and 
procedures for generating nucleic acid binding molecules, for example RNA aptamers, 
such as SELEX. These procedures are well known to those of ordinary skill in the art and 
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described below for the purposes of exemplification. The invention accordingly provides 
HPV-binding molecules targeted to the HPV E4 protein, which are useful in methods as 
described herein. 



5 According to the present invention, HPV binding molecules are preferably targeted to 
extracellular portions of the HPV E4 polypeptide. Such portions tend to be hydrophilic in 
character. Preferably, therefore, at least some of the molecules according to the invention 
specifically bind to hydrophilic portions of the HPV E4 protein. 

10 The present invention moreover describes a particular region of the E4 protein to which 
molecules (particularly antibody molecules or variants thereof) may bind with 
considerable specificity. Although homologous regions exist in all HPV E4 proteins, the 
region varies in amino acid sequence between HPVs of different types. The region 
corresponds to a peak of hydrophilicity in the E4 protein and is probably surface-exposed. 

15 The region is highly charged (acid/base-rich). In HPV type 16, the amino acid sequence 
of the region is (from N-terminal to C-terminal) 
RPIPKPSPWAPKKHRRLSSDQDSQTP. Clearly the amino acid sequence of the E4 
proteins of other HPV types will not necessarily be identical to that in type 16, but the 
corresponding region can readily be identified in other E4 proteins by those skilled in the 

20 art by use of conventional alignment and sequence comparison computer programs (about 
65 of the 70 or so known HPV genomes have been cloned and sequenced). 

Thus, the invention also involves an antibody molecule, or an antigen-binding variant 
thereof, which binds specifically to HPV E4 protein in the region of amino acid residues 
25 RPIPKPSPWAPKKHRRLSSDQDSQTP of HPV16 E4 protein, or the corresponding 
hydrophilic, acid/base-rich region of other HPV E4 proteins, preferably other than the 
antibody TVG 402 identified by Doorbar et a!, (1 992 Virology 1 87, 353-359). 



Moreover, the invention involves the use of an antibody molecule, or an antigen-binding 
30 variant thereof, which binds specifically to HPV E4 protein in the region of amino acid 
residues RPIPKPSPWAPKKHRRLSSDQDSQTP of HP VI 6 E4 protein, or the 
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corresponding hydrophilic, acid/base-rich region of other HPV E4 proteins for the 
detection of HPV infections as described herein. 

The corresponding hydrophilic acid/base-rich regions of large numbers of different HPV 
5. types are shown in Figure 9. Figure 9 shows a consensus-type amino acid sequence 
("most likely") on the top row, with the sequence of HPV E4 proteins below. Dots 
indicate gaps introduced to facilitate the alignment, dashes denote amino acid residue 
matches with the consensus sequence. Numbering on the right hand side of the figure 
indicates the number of amino acid residues from the actual or predicted E1 A E4 splice 
10 site. It will be appreciated by those skilled in the art from the alignment that whilst the 
hydrophilicity of the region is conserved amongst different HPV types, the actual amino 
acid sequence varies quite considerably, such that reagents binding to this region may be 
expected to be highly HPV type-specific. 

15 Preferably an antibody suitable for use in the invention has a binding site, as identified by 
the SPOTS epitope mapping system, within the region RRIPKPSPWAPKKHR (or the 
corresponding amino acid sequence from other HPV types). A particularly preferred 
molecule is the Fab fragment TVG405, described further below, which binds to the 
epitope PKPSPWAPKKH(R) with extremely high affinity and is of particular usefulness 

20 in the methods of the invention defined above. 

The arginine residue indicated in brackets at the C-tenninal of the TVG405 epitope is not 
essential for high affinity binding. 

25 The Fab fragment TVG405 was isolated by the present inventor using phage display 
technology, as described below. Those skilled in the art will understand that different 
antibodies or Fab fragments may readily be obtained by using similar phage display 
techniques (and screening with relevant polypeptides or portions thereof), or by using 
more conventional immunisation techniques (e.g. immunising mice, rabbits, rats or the 

30 like with HPV or cell proliferation proteins, or peptides corresponding to portions of such 
polypeptides) to obtain polyclonal antisera or monoclonal antibodies (using well known 
hybridoma techniques of Milstein et al). Complete antibody molecules can readily be 
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prepared from Fab - encoding sequences (e.g. isolated by phage display techniques) using 
standard DNA manipulation techniques described by Sambrook et al, (Molecular 
Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press, NY, 
USA) to join appropriate DNA sequences. 

Similarly, standard DNA manipulative techniques can be used to modify DNA sequences 
encoding antibodies or antigen-binding variants thereof. In particular site-directed 
mutagenesis or PCR can be used to modify the coding sequences, so as to produce 
modified anti-HPV or anti-cell proliferation marker antibodies with different binding 
specificities or affinities. Alternatively fusion proteins, comprising the binding site of an 
Fab, Fv or antibody and the like, may be prepared. 



Molecules capable of binding HPV polynucleotides or polypeptides or cell proliferation 
markers may be used as anti-viral or anti-cancer agents, or parts of such agents. For 
15 example, antibody molecules or E4-binding peptide as described above may be employed 
for this purpose. Preferably, however, the E4 protein and/or molecules capable of binding 
thereto may be used to design E4-binding molecules, preferably small molecules, by 
rational drug design. The invention moreover concerns the use of molecules capable of 
binding to E4 to target antiviral agents capable of destroying papilloma viruses and/or 
20 cells infected by papilloma viruses. Such molecules may be antibodies or peptides as 

described above and exemplified herein, optionally conjugated to anticancer or antiviral 

agents. 

Such a process preferably involves the crystallisation of the relevant polypeptide or a 
25 molecule capable of binding thereto. More preferably, such a process involves the co- 
crystallisation of the polypeptide and a binding agent. Such a procedure gives 
information concerning the interaction between the polypeptide and the binding molecule, 
which can be used to design small molecules capable of niimicking the binding 
interaction. 



30 



Crystallisation involves the preparation of a crystallisation buffer, for example by mixing 
a solution of the peptide or peptide complex with a "reservoir buffer", preferably in a 1:1 
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ratio, with a lower concentration of the precipitating agent necessary for crystal formation. 
For crystal formation, the concentration of the precipitating agent is increased, for 
example by addition of precipitating agent, for example by titration, or by allowing the 
concentration of precipitating agent to balance by diffusion between the crystallisation 

5 buffer and a reservoir buffer. Under suitable conditions such diffusion of precipitating 
agent occurs along the gradient of precipitating agent, for example from the reservoir 
buffer having a higher concentration of precipitating agent into the crystallisation buffer 
having a lower concentration of precipitating agent. Diffusion may be achieved for 
example by vapour diffusion techniques allowing diffusion in the common gas phase. 

10 Known techniques are, for example, vapour diffusion methods, such as the "hanging 
drop" or the "sitting drop" method. In the vapour diffusion method a drop of 
crystallisation buffer containing the protein is hanging above or sitting beside a much 
larger pool of reservoir buffer. Alternatively, the balancing of the precipitating agent can 
be achieved through a semipermeable membrane that separates the crystallisation buffer 

1 5 from the reservoir buffer and prevents dilution of the protein into the reservoir buffer. 

In the crystallisation buffer the peptide or peptide/binding partner complex preferably has 
a concentration of up to 30 mg/ml, preferably from about 2 mg/ml to about 4 mg/ml. 

20 Formation of crystals can be achieved under various conditions which are essentially 
determined by the following parameters: pH, presence of salts and additives, precipitating 
agent, protein concentration and temperature. The pH may range from about 4.0 to 9.0. 
The concentration and type of buffer is rather unimportant, and therefore variable, e.g. in 
dependence with the desired pH. Suitable buffer systems include phosphate, acetate, 

25 citrate, Tris, MES and HEPES buffers. Useful salts and additives include e.g. chlorides, 
sulphates and further salts specified in Example 1. The buffer contains a precipitating 
agent selected from the group consisting of a water miscible organic solvent, preferably 
polyethylene glycol having a molecular weight of between 100 and 20000, preferentially 
between 4000 and 10000, or a suitable salt, such as a sulphates, particularly ammonium 

30 sulphate, a chloride, a citrate or a tartrate. 
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A crystal of a peptide of interest, or a peptide/binding partner complex according to the 
invention may be chemically modified, e.g. by heavy atom derivatization. Briefly, such 
derivatization is achievable by soaking a crystal in a solution containing heavy metal atom 
salts, or a organometallic compounds, e.g. lead chloride, gold thiomalate, thimerosal or 
uranyl acetate, which is capable of diffusing through the crystal and binding to the surface 
of the protein. The locatibn(s) of the bound heavy metal atom(s) can be determined by X- 
ray diffraction analysis of the soaked crystal, which information may be used e.g. to 
construct a three-dimensional model of the peptide. 

A three-dimensional model is obtainable, for example, from a heavy atom derivative of a 
crystal and/or from all or part of the structural data provided by the crystallisation. 
Preferably building of such model involves homology modelling and/or molecular 
replacement. 

The preliminary homology model can be created by a combination of sequence alignment 
with any of the proteins of which the sequence is known, secondary structure prediction 
and screening of structural libraries. For example, the sequences of HSV 16 and 34 E4 
can be aligned as set forth herein. 

Computational software may also be used to predict the secondary structure of peptides or 
peptide complexes. The peptide sequence may be incorporated into the structure. 
Structural incoherences, e.g. structural fragments around insertions/deletions can be 
modelled by screening a structural library for peptides of the desired length and with a 
suitable conformation. For prediction of the side chain conformation, a side chain rotamer 
library may be employed. 

The final homology model is used to solve the crystal structure of peptides by molecular 
replacement using suitable computer software. The homology model is positioned 
according to the results of molecular replacement, and subjected to further refinement 
comprising molecular dynamics calculations and modelling of the inhibitor used for 
crystallisation into the electron density. 
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Similar approaches may be used to crystallise and determine the structure of HPV-binding 
or cell proliferation-binding polypeptides, including antibodies and antibody fragments, 
for example those used in the present invention. 

5 It has surprisingly been found that E4 expression correlates strongly with vegetative DNA 
replication in HPV-infected cells, making detection of E4 expression a particularly 
appropriate indicator of HPV infection, and thus particularly useful in screening for 
precancerous cervical lesions. 

10 Other available methods of cervical screening by HPV detection are based on DNA 
hybridisation. They involve cell lysis or penneabilisation and are performed in an 
ELISA-type 96 well format. The hybridisation is ultimately visualised as a colour change 
in one of the wells. 

15 Although the antibodies of the present invention could be used in a similar way (i.e. 
following cell lysis), they are amenable to a quicker procedure which would be more 
readily carried out routinely by histopathology laboratories. Samples comprising cervical 
cells may be taken as usual. These are be spread for example on a microscope slide or 
other support using techniques known in the art, for example as exemplified herein, and 

20 stained with, for example, an anti-E4 Fab. Detection may be performed with a secondary 
antibody-enzyme conjugate (horseradish peroxidase, alkaline phosphatase), or the Fab 
could be directly conjugated, for example to a fluorophore, such as FITC. This approach 
may be adapted for use with systems that are currently available for increasing the 
sensitivity of antibody detection. At present, cervical smears are examined routinely by 

25 microscopy. The proposed approach would require no new equipment and could easily fit 
around existing methods. 
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STAINING OF CERVICAL SMEARS 

One protocol for antibody staining of cervical smears is as follows. 
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BSA for 1 hour. solutioIL 
|. « :„ secondary antibody, malce up 
„, ivuiip slides are m secuuu»j 

25 Willie suu for 5 minutes. 

Place slides in endogenous peroxioas 
10 minutes. 

Wash in TBS for 5 minutes. 

30 ssr.— — — -- — - 

SABC solution for 30 minutes. 
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Tim sfer ^ racks washtaTBS in S 
Repeat step 22. 

Develop in DAB solution for 10 minutes. 
WashmrurmmgtapwaterforSminutes. 

Washmrunningtapwaterforlnunute. 
^ntiateinO^ohydrochloncacidforl^seconds. 

Washmrunning tap water for 5 minutes. 
Rinse in 50% methanol for 2 minutes. 
10 Rinsein70%methanolfor2minutes. 
Rinse in 90% methanol for 2 minutes. 

Rinse in 100% methanol for 2 minutes. 

P la cemOrangeGwor^gsolutionfor2minute, 

Rinseinl00%methanolfor7secondsandag 1 tate g ently. 

15 Repeat step 35. 

PlaceinEA50solutionfor2minutes. 
. Rinsein^/omemanolforTsecondsandagitategentiy. 

Repeat step 38. 

Place sUdes in xylene to clear for 5 minutes. 

20 Repeat step 40 twice. 

Apply covershps using DEPEX mountant. 

Sm ears for inununotluorescence may be ^^^Z^^^ 
secondar y antibody, they are mcuhated in A *+ Sigma at 50 

. , fnT nNA with propdium wdide/RNAse «. ^ 

ng/ml), then washed and mountea in b y 

Preferred binding molecules for use m aspect . j . . 

30 

• ovidesanewapproachtoscreeningforcervicalabnormahties. 
The present invention provides a new 
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The pr ior art in this field comprises diagnostic tests for the presence of papilloma viruses 
which were previously developed by us (eg. W098/25145). These rely on detection of 
viral constituents, principally using antibodies, and use this information about the state of 
5 viral infection as an indication of the likelihood of developing higher grade lesions or 
malignancies preceding cervical cancer. 

A different and unconnected prior art document teaches an approach to screening for 
malignancies by examining tissue samples such as blood, biopsies, body fluids etc, 
10 including cervical samples, for abnormally proliferating cells (WO99/21014). This relies 
on the detection of human cell cycle expressed polypeptides, which are usually 
intraceUular, and provides an indication of the presence of human cell lineages which are 
proliferating ectopically. 

15 Clearly, although these two approaches share some biochemical elements such as the use 
of immunological reagents, they are very different in principle and in practice. The 
former is based on collecting of information about viral life cycles, and characterisation of 
particular sub-types of viral infection, and relating this to the probability of more serious 
lesions developing. The latter test, on the other hand, is strictly concerned with the 

20 monitoring of the cell division state of human cells from various sources, and aims to 
relate this information to the presence or absence of malignancies in the sample. 

The present invention combines these two different tests to provide a much simpler 
approach which yields more information more quickly than would conventional pap- 
25 smear testing programmes, or than either test applied individually. - 

The present invention is founded in double and triple stains in which antibodies to E4 
have been combined with antibodies to genes such as PCNA, MCM and cyclinA which 
are present in cervical lesions because they are induced by E7. Thus these proteins can be 
30 considered to be surrogate markers of E7 activity. The prior art does not connect the 
expression of MCM in cervical lesions with the fact that such genes are, in effect, 
markers of E7 activity. The analysis of E4 and MCM in productive infections caused by 
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HPV1, HPV2 and HP VI 1 reveals a common pattern of expression in productive infection 
which changes in abortive infections caused by HPV16. This could not have been 
predicted from previous studies which did not make use of the double staining approach. 

5 It is an advantage of the present invention that the same sample can be used to provide 
much more information than a single individual test might. 

By screening for viral infection and for higher grade malignancies in the same test, only 
one sample needs to be obtained for a much broader picture of the patient's condition to 
10 be obtained. 

By screening for 'early' events which may lead to cervical lesions (i.e. HPV infections) 
whilst simultaneously examining the sample for later stages of malignancy, a better 
prognosis may be obtained. 



Furthermore, it is a known problem that conventional smear testing can produce a high 
proportion of false negatives, whereas by combining two entirely different tests into the 
single procedure of the present invention, the chances of such false negatives occurring is 
much reduced. 



A further advantage of the present invention is that fewer samples are needed to gain the 
same amount of information. For example, if a patient was discovered to have a HPV 
infection, the doctor may need to recall that patient for further examination, whereas by 
employing the methods of the present invention, more information is gained in the first 
25 instance, thereby reducing the burden on staff performing the tests, and those interpreting 
them. 

Moreover, patients do not generally find the smear test procedure comfortable. Many find 
it embarrassing and unpleasant, and therefore by using the methods of the present 
30 invention, which may alleviate the need for some repeat or further testing, patient 
suffering may be directly reduced by requiring fewer samples to be collected. 



15 



20 
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By simultaneously testing the same sample for more than one condition, there is 
introduced into the procedure a robust means for cross-comparison between the tests. For 
example, relative staining intensities might be meaningfully compared where they might 
not have been if the tests had been performed independently on different samples. This is 
another example of the synergistic advantages offered by the methods of the present 
invention. 



One of the problems in cervical screening, particularly in developed countries, is the often 
long periods which some women experience between tests. This could mean that by 
10 testing for only early stages of the disease, these may have already passed by the time a 
sample is examined. Equally, if only the late stages of the disease are monitored, then a 
patient may be correctly given a negative test, but develop the disease during the 
intervening period, and be at a dangerously advanced state at the next test some 5 years or 
more later. 

15 

Therefore, it is clearly advantageous to employ the methods of the present invention 
which allow for simultaneous assaying of both preceding conditions such as HPV 
infection, as well as later stages such as abnormal cellular proliferation. In this way, 
according to the present invention, whenever a patient presents for testing, the- disease 
20 should be detected whether it is in its infancy, or at a more critical advanced state. 

Further, by combining, the testing procedures according to the present invention, the 
doctor is presented with a better overall picture of the condition of the patient. It is 
possible to have only HPV infections, or to have an isolated advanced lesion, or to have 
25 numerous different abnormalities at different stages of development. By combining the 
tests according to the present invention, it is substantially easier to gain an appreciation of 
the condition of the patient, and to properly plan a course of appropriate treatment. 

" In addition, it is an advantage of the present invention that the necessity for complicated 
30 interpretation of histological samples is significantly alleviated. The methods of the 
present invention employ molecular tests which are easier to score than conventional 
smear tests which require a highly trained technician to judge the state of the cells by their 
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appearance. Molecular tests according to the present invention may be scored in a simple 
positive or negative fashion, rarely requiring a difficult judgement to be made. Indeed, it 
is an advantage of the present invention that it lends itself to straightforward automation. 
It would be possible to score the tests using a fluorescent reader or similar device which 
5 could be programmed to read at two or more different wavelengths and therefore provide 
information as to the various possible states of the sample, as opposed to a single state as 
prior art methods provide. 

It is envisaged that the standard method of detection may be modified/ Antibody binding 
10 may be carried out while the cells are in suspension, with cells being spun down prior to 
analysis. This would improve the quality of the screen. 

Considerable effort in diagnosis is aimed at automating screening methods. The use of 
antibodies or antigen-binding variants thereof for HPV or cell proliferation marker 
1 5 detection greatly facilitates this. 

The invention will now be described by way of illustrative examples . 
20 Example 1 

Preparation of Anti-E4 monoclonal and polyclonal immunoglobulins 

Although Mabs.against HPV16 E1 A E4 have been described previously (TVG401, 
402, 403; Doorbar et al, 1992) these reagents recognise a single overlapping epitope at the 
major antigenic site of E4, and have been reported not to detect the protein in archival 
25 tissue biopsies (Doorbar et al, 1992). 

Although these results suggest that E4 may not be a candidate for immunological 
detection of HPV, further antibodies are generated targeted at the N and C termini of 



The generation of further Mabs by standard hybridoma technology results in the isolation 
of TVG404, an IgM which recognises an epitope at the very C-terminus of the protein. 



HPV16E4. 



30 
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To complement this reagent polyclonal antiserum to the N-terminus of the protein is 
raised against an N-tenninal synthetic peptide (E4 N term). Polyclonal antibodies (to 
HPV16 and HPV63 E4 proteins) are prepared by immunisation of rabbits with maltose 
5 binding protein E4 fusion protein (MBP-E4). Antibody litres are monitored in ELISA 
using purified glutathione S transferase E4 fusion protein (GST-E4). 

Antibodies to the N-terminus of the protein are raised against the synthetic peptide 
MADPAAATKYPLC after conjugation to thyroglobulin or keyhole limpet haemocyanin 
10 through its C-terminal cysteine residue. Conjugation is carried out using 
m-Maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) as described by Green et al 



Antibody specificities are confirmed by epitope mapping, as follows: the HPV 16 E4 
15 protein is synthesised as a series of 85 overlapping octamers (single amino acid overlap) 
by solid phase fmoc chemistry using the SPOTS epitope mapping system (Genosys 
Biotechnologies, Cambridge, UK). Accuracy of synthesis is confirmed using the HPV 16 
E1 A E4 monoclonal TVG402 which binds the major antigenic site of the protein (Doorbar 
et al 9 1992). Filters are regenerated as described by the manufacturers and antibody 
20 binding is visualised by ECL (Amersham, Little Chalfont, UK). Polyclonal serum is used 
at 1/250 dilution, purified Fabs at approximately 1 g/ml, and hybridoma supernatant at 
1/10 dilution. 

In Figure 1 A the sequences of the 85 overlapping E4 synthetic peptides are shown at the 
25 top of the figure, and the results of the epitope mapping experiments are shown below. 
The dark spots represent binding of the antibody to the synthetic peptide shown above it. 
Only the portion of the filter containing peptides which react with each antibody are 
shown. 



(1982). 



30 In Figure IB, the locations of epitopes on the E1 A E4 amino acid sequence are summarised 
above the HPV 16 sequence. Alignment with a consensus E4 sequence prepared by 
comparison of 70 putative E1 A E4 sequences (Doorbar and Myers, 1996b) is shown 
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beneath the sequence of HPV16 E1 A E4, and the sequence of the HPV1 E1 A E4 protein is 
shown beneath this. The binding sites of the existing HPV1 E1 A E4 Mabs (Doorbar et ah 
1988) are shown beneath the HPV1 sequence. The proteolytic cleavage sites that give rise 
tothe 16Kand 10/1 IK gene products in the E1 A E4 protein ofHPVl (Doorbar a/, 1988; 
Roberts et al, 1994) are indicated beneath the HPV1 sequence allowing prediction of 
putative sites in the E1 A E4 sequence of HPV16. 



Example 2 

1 0 Preparation of Synthetic Immunoglobulins 

Fabs are isolated from a synthetic antibody displayed on fd bacteriophage 
(Griffiths et al, 1994) as described below. Immunotubes (Life Technologies, Paisley, UK) 
are coated overnight at 4°C with either MBP-E4 or GST-E4 at a concentration of 0.1 g/ml. 
These are subsequently blocked at 37°C for 1 hour in PBS/2% marvel™ prior to 

15 incubation in the presence of 10 u phage on a blood tube rotator (37°C). Unbound phage 
are poured off and tubes are washed lOx with PBS/0.1% Tween 20. Binders are eluted 
with lOOmM triethylamine pH 11.0 (1ml) and immediately neutralised with 1M Tris 
(pH8.0) before being reintroduced into E. coli TGI cells. The enriched library is grown up 
and the selection procedure repeated three more times. 

20 

Phage selections are carried out alternately against GST 16 E1 A E4 and MBP 16 E1 A E4 in 
order to prevent isolation of antibodies to MBP or GST protein, using a repertoire of 6.5 x 
10 10 (Griffiths et al, 1994). MBP 16 E4 is expressed at higher levels (>50mg/litre of 
bacteria) than the GST fusion (approx. 5mg/litre of bacteria) but, in any event, antibody 

25 isolation requires as little as 1 g of antigen (Hawkins et al, 1992). Phage displaying 
antibodies with affinity for E4 are identified by ELISA (against GST-E4, MBP-E4, GST 
and MBP), and activity is confirmed by phage western blotting* Immunoglobulin genes 
are transferred from the isolated phage into the bacterial expression vector 
pUCl 19.His.myc (Griffiths et al 1994) and soluble Fabs are purified from the periplasriiic 

30 space of induced bacteria by Nickel-NTA chromatography (Qiagen, Crawley UK). 
Antibody titres are established by ELISA. 
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After four rounds of selection, individual clones are examined for their ability to bind 
either E1 A E4, unfitted GST or MBP, or bovine serum albumin (BSA). 47 clones (out of 
100) are able to bind MBP 16 Ei A E4, of which 39 could also bind GST 16 E4. None of 
these clones interacted with BSA, GST or MBP. BsM fingerprinting (Marks et d t 1992: 

5 Nissim et a£ 1994) revealed three distinct Fabs amongst these clones, and their 
immunoglobulin genes are subcloned into the prokaryotic expression vector 
pUC119His.6myc to allow the production of soluble anti-E4 Fabs (Griffiths et al, 1994). 
Approximately 5mg (per litre of bacteria) of anti-E4 Fab (TVG 405, 406 and 407) can be 
extracted from the periplasmic space of induced bacteria and all are found to specifically 

10 detect E1 A E4 by ELISA and western blotting. Fab TVG 407 binds an epitope which is 
identical to that recognised by the hybridoma-derived Mab, TVG 409 (Fig 1). The 
remaining synthetic Fabs recognise novel epitopes upstream (TVG 405) or downstream 
(TVG 407) of this major antigenic region of E4 and the results are summarised in Figure 
1. 

15 

It is found that the amino acid sequence of the CDR3 loops of the TVG 405 and TVG 407 
Fabs are as follows: 

TVG 405 heavy chain CDR3 sequence: LLRGAFDY 
20 light chain CDR3 sequence: NSRDSSGGNAV 



TVG 407 heavy chain CDR3 sequence: LVQGSFDY 
light chain CDR3 sequence: QADSSTHV 

25 Measurement of Antibody Affinity 

Affinities of synthetic (TVG405, TVG406 and TVG407), and hybridoma-derived 
Fabs (TVG402) are analysed by surface plasmon resonance using a BIAcore 2000 
instrument (Pharmacia Biosensor, St. Albans, UK) as described by the manufacturer. 
MBP-E4 aggregates are dissociated under reducing conditions in 0.5% SDS, ImM ji- 

30 mercaptoethanol, 50mM Na 2 C0 3 /NaHC0 3 (pH 8.5) and biotinylated using NHS- 
LC-biotin (Sigma, St Louis, USA; 25mg/ml in DMSO) at a biotin:protein molar ratio of 
6:1 (Johnson et al, 1991). Monomeric MBP-E4 is recovered by FPLC chromatography 
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using a Superdex S200 HR10/3O column run in 6M Urea/lmM P~ 
mercaptoemanol/PBS/0.2mM EDTA (pH7 2), before being bound to a streptavidin-coated 
sensor chip and "refolded" in vitro in PBS/0.2mM EDTAyO.lmg/ml protease-free BSA 
(Sigma). Fabs are isolated from purified TVG402 using an Immunopure Fab kit (Pierce, 
5 Rockford, USA), and monomeric preparations are obtained by FPLC gel chromatography 
(Superdex S200 HR10/30 column run in PBS/0.2mM EDTA (pH 7.2)) Sensor chip 
surfaces are regenerated using 6M urea column buffer (described above). On and off rates 
are derived by non linear curve fitting using the proprietary 'BIAanalysis' software. 

10 Binding activities are in the order of 20% of total protein levels for the bacterially-derived 
antibodies, and 50% for Fabs derived from hybridoma culture supernatant. The affinities 
of TVG405 and TVG402 are calculated from on- and off-rates obtained by non-linear 
curve fitting to sets of BIAcore binding curves. 

15 Figure 2A shows an overlay of binding curves (sensograms) obtained after passing Fab 
TVG405 over a BIAcore chip coated with MBP-E4 fusion protein as described above. 
Fab concentrations range from lOmM (lowest curve) to 300nM (upper curve) through 5 
intermediate dilutions. The extent of binding is indicated in resonance units on the X- 
axis, against time in seconds on the Y-axis. Purified Fab is injected at around 100 

20 seconds and washing initiated at 700 seconds. The affinity QQ of TVG405 is calculated 
as between 0.3 and 1.25nM by analysis of the association and dissociation curves using 
BIAevaluation software (Pharmacia, UK). 

Figure 2B shows an overlay of binding curves (as described above) for the hybridoma- 
25 derived Fab TVG402 over a concentration range lOOnM to 1 M. The Kd is estimated as 
70nM. 

TVG405 has an association rate constant (M of 1.8 x 10 6 M A .s l and an off rate (W) of 
2 x 10 3 s 1 indicating a molar dissociation constant (Kd) of approximately InM. The best 
30 hybridoma-derived antibody - TVG402 - has an affinity of only 70nM, and had a ko„ of 
4.2xl0 4 MV and a k* valued 3xl0 3 s \ TVG 406 and "407 display rapid kinetics and 
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are thus examined by Scatchard analysis of equilibrium binding data, as shown for 
TVG407. 

Figure 2C shows the equilibrium binding curve of Fab TVG407, which displays rapid 
5 kinetics. Figure 2D shows Scatchard analysis of the data presented in Fig. 2C using 
BIAevaluation software. Equilibrium values are corrected for bulk refractive index 
changes by subtracting values from a surface blocked with biotin, from the values shown 
in Fig. 2C. In the plot shown the slope is - K<i and the Y-axis intercept is TUx', i e. the 
binding level at saturation with Fab. The uncorrected K<, values for TVG407 and 
10 TVG406 are 250nM and 140nM which, when the activity of the Fab preparation is 
considered, indicates actual affinities of 50nM and 28nM. 

TVG407 has an affinity (Kd) of 50nm after correction for biological activity, and 
TVG406 has an affinity (Kd) of 28nM. The amino acid sequence of the heavy and light 
15 chain CDR3 loops are established by DNA sequencing, further confirming that the three 
antibodies are distinct. 



20 



25 



Example 3 

Preparation ofAnti-E4 Peptides 

A commercially available two-hybrid screening kit is purchased from ClonTech 
and employed for identifying naturally occurring E4-binding peptides, according to the 
instructions given by the manufacturer. A HeLa cDNA library, obtained from the same 
supplier, is screened. By this method, seven DNA sequences are isolated which encode 
E4-binding polypeptides, of which three are identified after sequencing. 

The first peptide is ferritin. 

The second peptide is a keratin filament binding protein, which has the sequence set forth 
inSEQ.ID.No.2. 



30 
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The third polypeptide is a novel polypeptide recognised as a member of the DEADbox 
family of proteins, which contain the characteristic sequence motif DEAD. The sequence 
of the third polypeptide is shown in SEQ. ID. No. 3 . 

5 In order to identify the site of interaction between these polypeptides and E4, a series of 
overlapping peptides of between 10 and 20 amino acids in length is generated by PCR and 
displayed on phage as described above. The binders are subsequently employed as 
screening agents to identify HPV16 in mucosal lesions. 

10 Example 4 

Preparation of Anti-E4 RNA oligonucleotides 

RNA oligonucleotides, known as aptamers, which are capable of specific binding 
to target molecules can be generated by selection procedures such as SELEX. The SELEX 
method involves selection of nucleic acid aptamers, single-stranded nucleic acids capable 

15 of binding to a desired target, from a library of oligonucleotides. Starting from a library 
of nucleic acids, preferably comprising a segment of randomised sequence, the SELEX < 
method includes steps of contacting the library with the target under conditions favourable 
for binding, partitioning unbound nucleic acids from those nucleic acids which have 
bound specifically to target molecules, dissociating the nucleic acid-target complexes, 

20 amplifying the nucleic acids dissociated from the nucleic acid-target complexes to yield a 
ligand-enriched library of nucleic acids, then reiterating the steps of binding, partitioning, 
dissociating and amplifying through as many cycles as desired to yield highly specific, 
high affinity nucleic acid ligands to the target molecule. 

25 DNA Oligonucleotide Library 

DNA oligonucleotides 73 bases in length, having a randomised portion of 26 
bases, are used for the development of an aptamer capable of binding E4. A library of 
synthetic RNA oligonucleotides having the following structure is prepared: 

30 5' CCTGTTGTGAGCCTCCTGTCGAA(26N)TTGAGCGTTTATTCTTGTCTCCC 3' 
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Where N stands for any possible base in the random region. The random region is 
generated by using a mixture of all four nucleotides (ratio 6:5:5:4, A:C:G:T, to allow for 
differences in coupling efficiency) during the synthesis of each nucleotide in that stretch 
of the oligonucleotide library. The resulting complexity is theoretically 4 26 molecules. 
5 The scale of synthesis (O.lumol) followed by gel purification yields 8.8nmol which puts 
an absolute upper limit of approximately 5xl0 15 on the number of different molecules 
actually present. 

PCR Amplification with a 5' primer that introduces the recognition site for T7 RNA 
10 Polymerase (5 1 TAATACGACTCACTATAGGGAGACAAGAATAAACGCTCAA 3') 
and 3'primer (5' GCCTGTTGTGAGCCTCCTGTCGAA 3') results in the following 
template for transcription: 

5' TAATAGCACTCACTATAGGGAGACAAGAATAAACGCTCAA (26N) 
15 TTCGACAGGAGGCTCACAACAGGC 3' 

The RNA transcript itself has the following sequence: 

5 . GGGAGACAAGAAUAAACGCUCAA (26N) 

20 UUCGACAGGAGGCUCACAACAGGC 3' 

Aati,E4 aptamers are selected using a conventional SEL,EX procedure as described in US 
Patent 5,270,163. Each round consists of the following steps: 

25 1) Selection. The RNA and E4 protein are mixed, incubated at 37° C., washed through a 
nitrocellulose filter, and RNA is eluted from the filters. 

2) Amplification. The RNA eluted from filters is extended with AMV reverse 
transcriptase in the presence of 50 picomoles of 3' primer in a 50 pi reaction under 
30 conditions described in Gauss et al. (1987). To the resulting cDNA synthesis 50 
picomoles of 5' primer is added and in a reaction volume of lOOul and amplified with 
Taq DNA polymerase as described in Innis (1988) for 30 cycles. 
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3) Transcription. In vitro transcription is performed on the selected amplified templates 
as described in Milligan et al. (1987), after which DNasel is added to remove the DNA 
template. The resultant selected RNA transcripts are then used in step 1 of the next 

5 round. Only one-twentieth of the products created at each step of the cycle are used in the 
subsequent cycles so that the history of the selection can be traced. The progress of the 
selection method is monitored by filter binding assays of labelled transcripts from each 
PCR.reaction. After the fourth round of selection and amplification, the labelled selected 
RNA products produce binding to E4. The binders are used in the detection of HPV in 

1 0 cells derived from cervical smears . 



Example 5 

Detection of HPV in Cutaneous and Mucosal Lesions 
15 All the synthetic Fabs detect the HPV16 E1 A E4 protein in formalin fixed 

paraffin-embedded tissue, although TVG405 consistently show the highest level of 
staining (Figure 3). 

Figure 3 illustrates the use of synthetic Fabs to localise HPV16 E4 protein in vivo by 
20 iinmunostaining of a low grade HPV16 CIN I with Fab NIP-C11 (Griffiths et al, 1994), 
which has no reactivity towards HPV16 E4 (Fig. 3A), and the E4-specific Fab TVG405 
which is described here (Figs. 3B, C, D). Fabs are detected using 9E10 as secondary 
antibody followed by anti-mouse FITC conjugate. E4 is detectable in the upper layers of 
the epidermis but at greatly varying levels between different lesions with often only a few 
25 positive cells being apparent (C, D). The position of the basal layer is arrowed in C and 
D. Magnification is 200X. 

Epitope exposure by microwave treatment enhances the sensitivity of E4 detection, and 
even allows staining using TVG402 (Doorbar et al, 1992). The extent of E4 expression 
30 varies greatly between different lesions (8 HPV16-associated CIN1 biopsies are 
examined), ranging from expression only in rare cells scattered throughout the biopsy 
(Fig. 3), to widespread distribution throughout the most differentiated layers of the 
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epidermis (Fig. 4), comparable to the distribution of E4 in cutaneous warts caused by 
HPV1 and HPV63 where the production of infections virions is also high (Fig. 4). In low 
grade cervical intraepithelial neoplasia (ON 1) caused by HPV16, E4 and LI levels are 
also found to correlate closely, although expression of the two proteins is not coincident 

5 (as previously suggested (Brown et al, 1994). E4 expression precedes the synthesis of the 
major capsid protein by several cell layers (as revealed by double staining, see Fig. 4) and 
in high grade cervical lesions (CIN 2/CIN 3) E4 is often abundant, even though the 
expression of LI is no longer supported (Fig 4). This time delay between the 
commencement of E4 synthesis and the assembly of infectious virions is most apparent in 

10 HPV63, where E4 expression coincided with migration of an infected basal cell into the 
parabasal layers, while expression of LI is restricted to a narrow strip of cells in the upper 
granular layer. 

Figure 4 demonstrates that synthesis of E4 is not directly linked to the expression of 
1 5 capsid proteins in high and low grade HPV16 lesions, and benign warts. Figure 4 shows 
the results of triple staining using anti LI antisera (Figs. 4A, D, G), HPV16 E4 Fab 
TVG405 (Figs. 4B and 4E), polyclonal antisera to HPV63 E4 (Fig. 4H), and with DAPI 
(Figs. 4C, F, I). A, B and C represent a low grade HPV16-induced lesion (CIN I). D, E 
and F represent a high grade HPV16-induced lesion (CIN MH). G, H and I represent a 
20 section through a verruca caused by HPV63. In all cases E4 expression precedes LI 
expression although by only a few cell layers in CIN I (A, B). In the CIN MH we assume 
that terminal differentiation is insufficient to support synthesis of virion structural 
proteins (D) although E4 expression is abundant (E). The contrast between the onset of 
E4 expression and the detection of virus structural proteins is most apparent in cutaneous 
25 verrucas caused by HPV63 (G, H). The basal layer is indicated by an arrow on the DAPI- 
stained images. Magnification is 100X. 



Onset of vegetative viral DNA replication and expression ofE4 coincide 

Vegetative viral DNA replication is found to begin in cells of the mid spinous 
30 layer and to correlate exactly with the onset of E4 expression (Fig 5). 
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Figure 5 demonstrates that onset of vegetative viral DNA replication coincides with E4 
expression in low grade HPV16 lesions and in benign cutaneous warts. The figure shows 
triple staining using the HPV16 E4 antibodies TVG402, 405 and 406 (Fig. 5A) and HPV1 
E4 antibodies 4.37 and 9.95 (Fig. 5D), biotinylated DNA probe (Fig. 5B - HPV16, Fig. 5E 
5 - HPV1), or DAPI (Figs. 5C and F). A, B and C represent a section through an HPV16- 
induced ON I, and D, E and F represent a section through an HPVl-induced verruca. In 
the HPV16 ON I, vegetative viral DNA replication and E4 synthesis correlate in the mid 
to upper layers of the epidermis (A, B). In cutaneous lesions the two events are initiated . 
as soon as the infected cell leaves the basal layer (D, E). Basal cells are illustrated in the 
10 DAPI counterstained image (F). Magnification is 200X. 

In HPV 1-induced warts vegetative viral DNA replication and E4 synthesis commence 
much earlier, and are evident immediately after the infected basal cell migrates into the 
superficial layers (Fig 5). Only a proportion of the differentiating cells are permissive for 

15 vegetative viral DNA replication, and only in these cells is E4 detectable. Neighbouring 
cells showed neither late gene expression nor vegetative viral DNA replication, 
suggesting that onset of the two events is closely linked. Although the sensitivity of DNA 
and E4 detection is not established, these 'normal' cells are likely to be either 
non-permissive for viral replication or be uninfected. This precise correlation between E4 

20 expression and the onset of vegetative viral DNA replication is also seen in cutaneous 
warts caused by HPV63 and 65, and in common warts caused by HPV2. 

Cells undergoing late gene expression show an abnormal pattern of terminal 
differentiation when compared to non-permissive or uninfected cells 

05 Cells supporting the late stages of HPV infection can thus be identified by 

immunostaining with Fab TVG405 (for HPV 16) Mab 4.37 (for HPV1) or polyclonal 
antisera to E4 (HPV63). In warts caused by HPV1, Expositive cells lack detectable 
levels of filaggrin or involucrin (Fig 6(i)). Non-permissive (or uninfected) cells in the 
same lesion which show neither E4 expression nor vegetative viral DNA replication, 

30 express filaggrin and loricrin at levels indistinguishable from those in the surrounding 
epidermis. Correlation of E4 synthesis with the differentiation-specific keratins K4 and 
K13 reveals an identical pattern of inhibition. The intensity of K4 and K13 staining is 
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always lower in E4-positive cells than in neighbouring cells that are not expressing E4 
(Fig.6(ii)). K5 and 14, which are present in the basal and lower parabasal cells, are 
unaffected. This interference with the detection of expression differentiation-specific 
keratins (Kl and K10 in cutaneous skin) is also apparent in cutaneous warts caused by 
5 HPVI (Fig 6(ii)) but is not evident in warts caused by HPV63 (Fig 6(ii)). The E4 protein 
of HPV63 is most closely related to that of HPVI . 

Figure 6 illustrates that productive infection interferes with normal epithelial terminal 
differentiation in low grade HPV16 lesions and in benign cutaneous warts. Figure 6(i) 
10 (keratin expression) shows triple staining using the HPV16 E4 Fabs TVG405/TVG406 
(Fig. 6(i)A), HPV 1 E4 monoclonals 4.37/9.95 (D), and HPV63 E4 polyclonal antibodies 
(G), in conjunction with antibodies to the differentiation-specific mucosal keratins 4 and 
13 (B) or cutaneous keratins 1 and 10 (E, H). Figures 6(i) C, F and I show the DAK 
counter stain. A, B and C represent a section through a HPV16-induced CIN I. D, E and 
15 F show a section through the edge of an HPVl-induced verruca, while Figures 6(i) G, H 
and I show a section through an HPV63-induced wart. In HPV 16. and HPVl-induced 
lesions, differentiation-specific keratins are less apparent in Expositive cells than in 
neighbouring cells (A, B, D, E) although this is not the case with HPV63 (G, H). Nuclear 
degeneration (visualised by DAPI counter staining) is retarded in Expressing cells (A 
20 C,D,F). Magnification is 200X. 

Figure 6(ii) relates to filaggrin expression. The figure shows triple staining, as described ; 
above, except that Figures 6(ii) B and E show filaggrin staining. E4 staining is shown in 
figures 6(ii) A and D, and DAPI counter staining is shown in figures 6(ii) C and F. A, B 
25 and C represent the edge of an HPV63-induced wart where normal skin (left hand side of 
figure) meets the benign tumour (right hand side of figure). D, E and F show the granular 
layer of an HPVl-induced wart. E4-positive cells do not express detectable levels of the 
differentiation-specific marker filaggrin, and show marked nuclear preservation when 
compared to neighbouring uninfected or non-permissive cells. Magnification is 200X. 
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The intracellular distribution of the HPV16 E4 proteins is distinct from the distribution of 
E4 in cutaneous lesiom caused by HPV1 and HPV63. 

The E1 A E4 protein of HP VI is predominantly cytoplasmic and assembles into 
inclusions that coalesce and increase in size as the cell migrates towards the surface of the 

5 skin. The E1 A E4 protein of HPV63 is found to have a fibrous and granular distribution. 
By contrast, HPV16 E4 had a filamentous and perinuclear distribution in cells of the 
lower epidermal layers (Fig, 7), and assembled into prominent structures only in the more 
differentiated cell layers. These 'inclusions' are always found singly per cell (c.f. multiple 
inclusions found in most cutaneous lesions), are located adjacent to the nucleus, and are 

10 nearly always detected on the side of the nucleus closest to the surface of the epidermis. 
Although similar in appearance to the E4/intermediate filament bundles which form after., 
expression of the HPV16 E1 A E4 protein in epithelial cells in vitro, we have not detected 
the presence of keratins in these structures in vivo. Antibodies to the very N-terminus of 
HPV16 E1 A E4 stained the structures much less readily than antibodies to C-tenninal 

15 epitopes (TVG 404, TVG405, TVG406) suggesting that the N-terminal region maybe 
either hidden or lost 

Figure 7 shows the association of the HPV16 E4 proteins with perinuclear bundles and 
filamentous structure in vivo, in particular the detection of HPV16 E4 proteins in the 

20 upper layers (Figs. 7A, B) and lower layers (Figs. 7C, D) of a HPV16 ON I using a 
mixture of Fabs TVG405 and TVG406. In the upper layers E4 is diffuse throughout the 
cytoplasm but with a prominent perinuclear pattern. Concentration of E4 into perinuclear 
bundles (arrowed in Fig. 7B) is apparent in these cells. In the lower layers, E4 has a 
predominantly perinuclear and filamentous appearance (Figs. 7C, D), but is not 

25 concentrated into perinuclear bundles. Magnification for Figs. 7A and C is 200X; that 
forBandDis400X. 

Confocal imaging revealed the N-terminal antibodies to localise primarily to the edge of 
the E4 structures while anti C-terminal staining is strongest in the centre (data not shown). 
30 When compared to the distribution seen with TVG405 and TVG406, the anti N-terminal 
reagent revealed HPV16 E1 A E4 to have a more diffuse distribution in the cell (Fig. 8). 
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No significant difference is apparent between the staining pattern of TVG405, 406, 407 
and the C-terminal antibody. 

Figure 8 provides evidence for processing of the HPV16 E4 proteins in vivo and shows 
5 triple staining in the upper layers of a HPV16 ON using HPV16 E4 Fab TVG406 which 
recognises an epitope in the C-terminal half of the E4 protein (Fig. 8A), an antibody to the 
N-terminal 12 amino acids of the HPV16 E1*E4 protein (Fig. 8B) and DAPI (Fig. 8C). 
TVG402, 403, 404, 405 and 407 produced staining patterns that are not significantly 
. different from that of TVG 406. And N-terminal antibodies did not effectively stain the 
10 perinuclear bundles (8B) which are evident with TVG406 (arrowed in 8A) suggesting that 
as in HPV1, different forms of the protein have different intracellular locations. 
Magnification is 400X. 

Example 6 

1 5 Detection ofHPV in cells isolated from cervical lesions 

Slides suitable for imaging of cells derived from cervical smears stained using 
anti-E4 antibodies are prepared by the method set forth in US 5,346,831. Cells are 
isolated from a patient according to conventional procedures and dissolved in 10ml 
alcohoVsaline buffer. The sample is prepared for centrifugation by disaggregating the 

20 clumps or clusters of cells in the sample vial by vortexing. After disaggregation, the 
sample is drained completely and layered over a density gradient in a 12 ml conical tube, 
wherein the density gradient is formed with a plasma expander material comprising 6% 
betastarch solution, and 0.9% physiological saline, also known by the tradename 
"Hespan" made by NPBI, Emmer-Compascuum, the Netherlands. 

25 

12 ml conical tubes containing density gradient and sample cells are placed into centrifuge 
buckets, balance and centrifuged for 5 minutes, at a force of about 600G. The liquid is 
then aspirated down to the 5 ml mark on the conical tube. The centrifuge buckets are 
removed and the 12 ml conical tube centrifuged with remaining liquid for 10 minutes, at 
30 800G. The tubes are emptied of supernatant, tapping lightly 2 or 3 times at a 45 degree 
angle. The tubes now contain packed cells of varying volumes. Upon mixing to 
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homogeneity, the pellets generally contain the same concentration of cells per unit 
volume of liquid. 

50ul of deionised H 2 0 is added, and the sample mixed by syringing 5 times through a 
5 0.042 inch tip. Upon completion of mixing, 150 ul of sample followed by 500 ul of 
deionised H 2 0 is dispensed into a sedimentation vessel attached to a slide which has been 
conventionally coated with Poly-L lysine (1% Sigma). The transferred sample is allowed 
to settle within the vessel for approximately 10 minutes. The excess sample is aspirated 
off and the chamber rinsed with 2 ml deionised H 2 0 two times (aspirating between each 
10 addition). 

FITC-labelled Fabs are then applied to the cells according to known procedures and the 
binding visualised by fluorescence microscopy. 

15 Example 7 

Double staining of sections with anti-MCMand anti-HPV antibodies 

•1 

Protocols 

Antigen exposure by microwaving followed by fluorescent immunostaining 
20 Paraffin embedded sections were de-waxed in xylene (4x5 min), hydrated in MS 

(3x5 min) and rinsed in de-ionised water for 2 x 5 min. Sections were microwaved on 
high power (800W) for 15 min in 500ml antigen retrieval buffer (10mM c itric acid, pH 
6.0), and allowed to cool in the buffer for 20 min. After washing twice for 5 min in PBS, 
slides 'were blocked in PBS, 10% serum, 1% BSA (v/v) for 2 hours at room temperature. 

25 Primary antibodies were diluted in PBS, 1% B SA, applied to the sections and incubated 
overnight at 4°C in a humidified chamber. Stringent washes in PBS, 0.05% Tween 20 
were carried out (4x5 min) followed by a rinse in PBS. Biotin labelled secondary 
antibodies, HPV 16E4 (Alexa 488-conjugated) and a nuclear counterstain, diluted in 1% 
BSA/PBS, were incubated on the section for 2 hours at room temperature. FoLJowing 

30 washes in PBS (3x 5min), the biotin signal was amplified with an SABC-AP kit (DAKO), 
followed by 3 x 5min PBS washes. The Alkaline phosphatase signals were detected by a 
10 min incubation with Fast Red. The reaction was halted after 10 min by rinsing in 
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water. The samples were mounted in citifluor mounting medium (Agar Scientific) and 
viewed under fluorescent light with a Nikon Labophot II microscope. Digital images were 
captured with a SenSys monochrome camera and IPLab imaging software (Roper 
Scientific ). 

5 

Antigen exposure by pressure cooking followed by enzymatic detection. 

Slides were de-waxed, pressure cooked and blocked as above. Slides were 
incubated overnight at 4°C with DIG labelled HPV 16 E4 (TVG405) and a mouse 
monoclonal MCM2 antibody, diluted 1:50 and 1:15 respectively in TBS, 1%BSA (w/v), 
10 0.1% triton (v/v). After washing in TBS the slides were incubated with anuDlG-AP 
secondary antibody diluted 1:50 and anti-mouse biotin diluted 1:200 in TBS containing 
1% BSA, for 1 hour at room temperature. After 3 x 5min washes in TBS the MCM signal 
was amplified with an SABC-HRP kit (DAKO), followed by 3 x 5min TBS washes. 
Signals were detected with DAB (diminobenzidine) and Fast Red. The reaction was 
15 halted by rinsing in water and a light haemotoxylin counter-stain was applied. 
Alternatively the E4 was detected with anti-DIG-HRP and DAB, and the MCM was 
amplified with an SABC-AP kit and Fast Red detection. Sections were dehydrated 
through graded alcohols and mounted in DPX mounting medium. Brightfield images were 
captured with Nikon Eclipse 600 microscope with a Nikon Lucia imaging camera and 
20 associated software. 

Double staining is carried out in HSIL and LSIL as described; the results are shown in 
Figures 10 and 11. 

25 MCM staining is usually abundant in the surface layers of HSIL whereas expression of the 
E4 protein has a sporadic distribution. This is in agreement with the previous report by 
Williams et al. (PNAS 95, 1998) who showed that in HSIL, 95% of surface cells are 
positive for MCM staining. There has been no previous investigation of the E4/MCM 
colocalisation. 
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Although Williams et al reported that MCM staining was apparent in LSIL (53% of 
surface cells showing staining, PNAS 95 (1998) pl4935) , we have found that in regions 
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that are positive for E4, MCM is usually absent from the surface layers (see 5388, 10451, 
4165 and 2565). Some LSIL (e.g. see LSIL-low (2565)) which stain positive for E4 
appear to lack any surface MCM staining. In the examples shown, . E4 staining is 
widespread at the surface whereas MCM staining is absent. 

" From the data presented here, cervical scrapes taken from HSIL will show high levels of 
MCM staining but show only low levels of E4 expression. Moreover, the data suggest 
that E4 antibodies should allow detection of LSILs which can not be detected using 
antibodies to MCM proteins. 



When the data presented in HSIL E4/MCM and LSIL E4/MCM are combined an 
additional conclusion can be made. The relative numbers, of E4-positive and MCM- 
positive cells in the surface layers indicates the severity of disease. High numbers of 
MCM-positive cells and low numbers of E4-positive cells indicates the presence of HSIL. 
15 High levels of E4-positive cells and.low numbers of MCM-positive cells in the surface 
layers is typical of LSIL. This type of analysis may be done using a cell sorter following 
double staining using antibodies to E4 and MCM. 

Antibodies to MCM2, MCM5 and MCM7 gave essentially the same patterns for both 
20 HSIL and LSIL. 

Example 8 

Double staining of normal cervical epithelium 

25 Figure 12 shows double staining, performed as in Example 7, for normal cervical 
epitheUum. Neither E4 nor MCM are present in the surface layers of normal cervical 
epithelium, emphasising the value of these antibodies for the detection of cervical 
abnormalities. The expression pattern of both proteins changes in regions of neoplasia and 
both E4 or MCM can be found at the surface. 
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Example 9 

Double staining for E4/MCM in other HPV types 

Figure 13 shows double staining results obtained with HPV1, HPV2 and HPV 16 warts. 
The HPV E7 protein acts by binding to the cellular Rb gene product. In its 
unphosphorylated state, Rb normally prevents cell cycle progression by binding to the 
transcription factor E2F. During progression through Gl , the levels of cyclin D kinase rise 
in response to growth factors acting externally on the cell. This leads to phosphorylation 
of Rb and release of E2F. E2F stimulates the synthesis of the proteins necessary for S- 
phase entry (reviewed in Virology edited by Fields, B .N., Knipe, D.M., Howley, P.M., 
published by Lippincott-Raven, Philadelphia, USA). Proteins involved in cellular DNA 
replication, such as MCM, cdc6, PCNA and cyclin A are amongst these. E2F-induced 
gene products (see Harbour & Dean, Nat Cell Biol 2000 Apr; 2(4):E65- 7). 

; In normal differentiating epithelial cells, cell cycle progression does not occur as the cells 
are committed to terminal differentiation. Neither S-phase markers nor HPV proteins are 
expressed in the differentiating cells of normal cervical epithelium. This is apparent from 
the images presented in Example 8. 

D In warts caused by HPV 1 and 2, and in E4-expressing LSIL caused by HPV 16, MCM is 
present in the lower layers of the epidermis. Based on our knowledge of HPV protein 
function this is predicted to be a result of E7 expression. The HPV E7 protein binds to Rb 
and displaces E2F which ultimately leads to entry into S-phase. Thus cellular genes which 
are activated by E7 can be considered as surrogate markers for the presence of the early 
25 viral genes. These genes are expressed from the p97 "early promoter in HPV16. In 
productive infections caused by HPV1, HPV2 and HPV16, the MCM proteins do not 
reach the surface of the epithelium. We predict that this is a common pattern which is 
likely to extend to all HPV types. 

30 Vegetative viral DNA replication coincides with the first appearance of E4 (Doorbar et 
al„ Virology 238 p40-52). E4 is expressed from the major late promoter, which in HPV16 
is at P 671 . Thus the appearance of E4 marks the onset of the late stage of the virus life 
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cycle. In lesions caused by HPV1, HPV2 and HPV16, the presence of E4 persists into the 
surface layers. Our recent work has shown that E4 can arrest the cell cycle in G2. In all 
three HPV types studied there is an overlap between the first appearance of E4 and the 
loss of staining for MCM. This observation leads us to suggest that these double positive 
cells are being pushed into S-phase by E7 but are unable to enter mitosis as a result of E4 
expression. Such a situation is likely to be conducive for high level replication of the viral 
genome. This theory is compatible with all the experimental data gathered to date. 

Thus we can summarise by stating that there are four types of cell: 

1. The presence of MCM-positive cells at the surface can be regarded as a sign that 
productive infection is not being completed and that the virus is producing an abortive 
infection. During productive infection, MCM+/E4- cells are found only, in the lower, 
layers. Such cells are abundant in the surface layers only in HSIL and cancers. 

2. MCM+/E4+ double positive cells are found in the intermediate layers in normal 
productive infections (i.e. in the MCM/E4 overlap region). Identification of these cells at 
the surface may indicate a lower grade lesion. 

20 3. Cells that are +ve for E4 only (MCM-/E4+ ) and which lack S-phase markers altogether 
cannot be supporting vegetative viral replication as the proteins necessary for viral 
replication are not present. The presence of these cells at the surface is characteristic of 
lesions which have initiated productive infection. Productive infections can also be 
categorised as LSEL. 



15 



25 



4. LI expression follows E4. The presence of L1+/E4+/MCM- cells in a smear could be 
used to highlight a lower grade of lesion (HPV I) in which the life cycle is being 
completed/This has not been investigated. 
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Lesions caused by HPV 1 and HPV2 which do not progress to cancer, do not show the 
great variations. All cells at the surface are MCM-/E4+ and some express both E4 and 
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L1+/E4+/MCM-. Cells at the surface showing MCM+/E4+ and MCM+/E4- phenotypes 
would suggest the presence of an abortive infection such as HSU. 

These patterns are only seen in lesions such as those caused by HPV 16, which are 
progressing towards cancer. Lesions which are supporting the full HPV life cycle such as 
those caused by HPV1 and HPV2 have only LB-/E4+/MCM- and E4+/MCM- cells at their 
surface. 



Example 9 

1 0 Other E2F-induced proteins can be substituted for MCM 

Most cervical abnormalities result from infection with a high risk papillomavirus. E7 is 
the viral protein which mediates cell proliferation. Surrogate markers of E7 expression 
such as MCM are present in all cervical neoplasias, but are confined to the lower 

15 epithelial layers in productive infections. Although MCM appear good markers of S-phase 
entry, there is no obvious reason why other cellular proteins induced by E2F could not be 
used in the same way. PCNA is essential for viral DNA replication and is also regulated 
by E2F (Hingorani & O'Donnell, Curr Biol 2000 Jan 13;10(l):R25-9). PCNA has been 
widely used as a marker of proliferating cells. Antibodies to Ki67 have also been used for 

20 this purpose. 

The images shown in Figure 14 demonstrate that double stains using antibodies to PCNA 
and E4 are broadly similar to those produced using antibodies to E4 and MCM. During 
productive infection, PCNA is expressed in the lower epithelial layers but overlaps the 
25 region in which E4 is found. PCNA is present in the surface layers in HSIL. 

We have also looked at the distribution of cyclin A which is also induced by E2F. Cyclin 
A is a kinase subunit that is essential for progression through S-phase. In HSIL, cyclin A 
extends into the surface layers. Double stains using antibodies to E4 and cyclin A reveal a 
30 region of overlap between the sites of expression of the two proteins. In most low grade 
lesions, cyclin A is confined to the lower epithelial layers whilst E4 is abundant at the 
surface. 
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Proteins such as PCNA and cyclin A thus have a similar overall distribution to MCM in 
both LSEL and HSEL. PCNA/E4 staining or cyclin A/E4 staining may thus be used for 
diagnosis in the same way as has been shown here for E4 and MCM. Although a 
comparison of the sensitivity of these approaches has not been carried out, PCNA appears 
to detect S-phase cells with a sensitivity that is similar to that of MCM. Other E2F- 
induced genes could be assessed in the same way. 



Example 10 

Other markers of cell proliferation can be substituted for E2F induced genes 



Cyclin B is a marker of G2 (Ohi et al, Curr Opin Cell Biol 1999 Apr;l l(2):267-73). As 
shown in Figure 15, cyclin B staining extends into the surface layers in HSIL. In LSIL in 
which E4 is expressed, these proteins are confined to the lower epithelial layers. As with 
15 PCNA, cyclin A and MCM, the regions which show cyclin B staining overlap the regions 
in which E4 can be detected in LSIL. Cyclin B does not extend into the surface layers in 
such lesions. Other cellular proteins which are induced by the virus may be used in 
conjunction with E4. 
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pl6 is a cell cycle inhibitor of the INK family. This protein is normally upregulated in 
response to unscheduled entry into S-phase. P 16 binds to cychnD/cdk4 and cyclinD/cdk6 
and inhibits its function. Cyclin D complexes normally phosphorylate Rb allowing release 
of E2F and entry into S phase. This regulation is bypassed by the HPV E7 protein. E7 
binds directly to Rb and causes release of E2F irrespective of the presence of cyclinD. 
25 Thus in high risk HPV infections, pl6 is abundant (Sano et al., Am J Pathol 1998 
Dec;153(6):1741-8). Although pl6 is not a proliferation marker it appears to be a useful 
marker of HPV early gene activity. 

E4 staining may thus be combined with other markers of virus activity such as cyclin B 
30 and P 16. These markers may be useful when used in conjunction with E4 for the 
diagnosis of cervical neoplasia. 
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All publications mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described methods and system of 
the invention will be apparent to those skilled in the art without departing from the scope 
and spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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CLAIMS 

1. A method for detecting abnormalities in a sample from a patient, said method 
comprising: 

5 (a) obtaining a sample of the patient's cells; 

(b) contacting said cells with two or more molecule(s), wherein at least one molecule 
is capable of binding a papilloma vifus associated nucleic acid or antigen, and at least one 
molecule is capable of binding a cell proliferation marker or a viral activity marker, and 

(c) monitoring said binding. 

10 

2. A method according to claim 1 wherein the sample comprises cervical cells. 

3. A method according to any preceding claim wherein the sample comprises cells 
collected from a cervical smear. 

15 

4. A method according to any preceding claim wherein the cells are contacted with 
the two or more molecules simultaneously. 

5. A method according to any preceding claim wherein the papilloma virus is human 
20 papillomavirus. 

6. A method according to any preceding claim wherein the human papilloma virus 
comprises one or more types selected from the group consisting of HPV types 16, 18, 33, 
35, 45, 51, 52, 56, 58 and 61. 

25 

7. A method according to any preceding claim wherein at least one molecule is a 
molecule that binds specifically to a subset of HPV E4 proteins. 

8. A method according to any preceding claim wherein at least one molecule is a 
30 molecule capable of binding to the papilloma virus E4 protein, and is capable of binding 

within a hydrophilic region of the E4 sequence. 



m • 

(^B ^^CT/GBOl/01176 
WO 02/08764 ^gf 

61 

9 A method according to any preceding claim wherein the cell proliferation marker 
comprises one or more polypeptides which are members of a preinitiation complex of 
DNA replication. 

5 10. A method according to any preceding claim, wherein at least one molecule is 
capable of binding a papilloma virus associated nucleic acid or antigen, and at least one 
molecule is capable of binding a viral activity marker. 

11. A method according to claim 10, wherein the viral activity marker is pi 6 or cyclin 
10 B. 

12. A method according to any one of claims 1-9, wherein at least one molecule is 
capable of binding a papilloma virus associated nucleic acid or antigen, and at least one 
molecule is capable of binding a cell proliferation marker. 

15 

13 A method according to claim 12 wherein the cell proliferation marker compnses 
one or more polypeptides selected from the group consisting of CDC6, MCM2, MCM3, 
MCM4, MCM5, MCM6, MCM7, Cdc7 protein kinase, Dbf4, CdcH protein phosphatase, 
Cdc45, PCNA, Ki67, KiSl, cyclin A and MCM10. 

20 

14. A method according to claim 12, wherein the cell proliferation marker is not an 
MCM polypeptide. 

15. A method according to claim 13 or claim 14, wherein the cell proliferation marker 
25 is selected from the group consisting of cyclin A, Ki67 and PCNA. 

16. A method according to any preceding claim wherein the HPV or marker of cell 
proliferation or viral activity, or both, are detected at the mKNA stage. 

30 17. A method according to any preceding claim, wherein the molecule capable of 
binding a papilloma virus associated nucleic acid or antigen, or capable of binding a 
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marker of cell proliferation or viral activity, is an antibody or an antigen-binding fragment 
thereof. 

18. A method according to any preceding claim, wherein the cell proliferation marker 
5 or marker of viral activity is induced by E6 and/or E7. 

19. A method for detecting abnormalities in a sample from a patient, said method 
comprising: 

(a) obtaining a sample of the patient' s cells; 

1 0 (b) contacting said cells with two or more molecule(s), wherein at least one molecule 
is capable of binding one papilloma virus associated antigen or a plurality of papilloma 
virus associated antigens, and at least one molecule is capable of binding one cell 
proliferation or viral activity marker, or a plurality of cell proliferation or viral activity 
markers; and 

15 (c) monitoring said binding. 

20. A kit comprising two or more molecule(s), wherein at least one molecule is 
capable of binding a papilloma virus associated antigen, and at least one molecule is 
capable of binding a marker of cell proliferation or viral activity, and instructions for use 

20 of said molecules in a method according to any preceding claim. 

21. A molecule is capable of binding a papilloma virus associated nucleic acid or 
antigen, and a molecule is capable of binding a cell proliferation marker, for 
simultaneous, simultaneous separate or sequential use in the detection of abnormalities in 

25 a sample. 

22. A method for assessing the risk associated with cellular abnormality in a patient 
sample, comprising obtaining a sample of the patient's cells; contacting said cells with 

' two or more molecule(s), wherein at least one molecule is capable of binding a papilloma 
30 virus associated nucleic acid or antigen, and at least one molecule is capable of binding a 
cell proliferation marker or a plurality of cell proliferation markers and/or marker(s) of 
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viral activity; and categorising the risk according to (a) the presence of cell proliferation 
markers, and (b) the detection of high or lowriskHPV virus infection. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MEDICAL RESEARCH COUNCIL 

(B) STREET: 20 PARK CRESCENT 

(C) CITY: LONDON 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP): WIN 4AL 

(ii) TITLE OF INVENTION: IMPROVEMENTS IN OR RELATING TO SCRE 
FOR PAPILLOMA VIRUSES 

(iii) NUMBER OF SEQUENCES: 3 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(ix) FEATURE: 



(A) NAME /KEY: CDS 

(B) LOCATION : 1 . .375 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GCG CTG CCA CTC TCA GAA GTT ATT GTC AAA AAC TTG CAA CTT GCT TTG 
48 

Ala Leu Pro Leu Ser Glu Val He Val Lys Asn Leu Gin Leu Ala Leu 
1 5 10 15 

GCA AAT AGC TCT CGA AAT GCT GTC GCT CTT TCT GCC AGC CCT CAA CTG 
96 

Ala Asn Ser Ser Arg Asn Ala Val Ala Leu Ser Ala Ser Pro Gin Leu 

20 25 30 

AAA GAG GCC CAG TCA GAG AAG GAA GAA GCC CCA AAG CCA CTT CAC AAA 
144 

Lys Glu Ala Gin Ser Glu Lys Glu Glu Ala Pro Lys Pro Leu His Lys 
35 40 ' 45 

GTA GTG GTA TGT GTT AGT AAA AAA CTC AGT AAG AAG CAG AGT GAA CTA 
192 

Val Val Val Cys Val Ser Lys Lys Leu Ser Lys Lys Gin Ser Glu Leu 
50 55 60 

AAT GGG ATC GCA GCC TCT CTA GGA GCA GAT TAC AGG TGG AGT TTT GAT 
240 

Asn Gly He Ala Ala Ser Leu Gly Ala Asp Tyr Arg Trp Ser Phe Asp 
65 70 75 80 

GAA ACA GTG ACT CAT TTC ATC TAT CAA GGG CGG CCA AAT GAC ACT AAT 
288 

Glu Thr Val Thr His Phe He Tyr Gin Gly Arg Pro Asn Asp Thr Asn 
85 90 « 

CGG GAG TAT AAA TCT GTA AAA . GAA AGA GGA GTA CAC ATT GTT TCC GAG 
336 

Arg Glu Tyr Lys Ser Val Lys Glu Arg Gly Val His He Val Ser Glu 
100 105 HO 

CAC TGG CTT TTA GAT TGT GCC CAA GAG TGT AAA CAT CTT 
375 

His Trp Leu Leu Asp Cys Ala Gin Glu Cys Lys His Leu 

115 120 125 



2 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Ala Leu Pro Leu Ser Glu Val lie Val Lys Asn Leu Gin Leu Ala Leu 
! 5 10 15 . 

Ala Asn Ser Ser Arg Asn Ala Val Ala Leu Ser Ala Ser Pro Gin Leu 
20 25 30 

Lys Glu Ala Gin Ser Glu Lys Glu Glu Ala Pro Lys Pro Leu His Lys 
35 40 45 

Val Ser Lys Lys Leu Ser Lys Lys Gin Ser Glu Leu 



Val Val Val Cys 
50 



55 60 



Asn Gly He Ala Ala Ser Leu Gly Ala Asp Tyr Arg Tr P Ser Phe Asp 

n c 80 

65 ™ 75 

Glu Thr Val Thr His Phe He Tyr Gin Gly Arg Pro Asn Asp Thr Asn 



85 



90 95 



Arg Glu Tyr Lys Ser Val Lys Glu Arg Gly Val His He Val Ser Glu 
100 105 HO 

His Trp Leu Leu Asp Cys Ala Gin Glu Cys Lys His Leu 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



i 
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(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Pro Glu Glu His Asp Ser Pro Thr Glu Ala Ser Gin Pro lie Val 



10 I 5 



Glu Glu Glu Glu Thr Lys Thr Phe Lys Asp Leu Gly Val Thr Asp Val 
20 25 30 

Leu Cys Glu Ala Cys Asp Gin Leu Gly Trp Thr Lys Pro Thr Lys He 



35 40 45 

Gin lie Glu Ala Tyr Ser Leu Ala Leu Gin Gly Arg Asp He lie Gly 
50 55 60 

Glu Thr Gly Ser Gly Lys Thr Gly Ala Phe Ala Leu Pro lie 



Leu Ala 80 
65 



70 " 



Leu Asn Ala Leu Leu Glu Thr Pro Gin Arg Leu Phe Ala Leu Val Leu 
85 90 « 

Thr Pro Thr Arg Ser Trp Pro Phe Arg Ser Gin Ser Ser Leu Lys Pro 



100 



105 



110 



Trp Ser Ser lie Gly Val Gin Ser Ala Val He Val Gly Gly He Asp 
115 . 120 125 

Ser Met Ser Gin Ser Leu Ala Leu Ala Lys Lys Pro His He lie He 
130 135 140 



Thr Pro Gly Arg Leu He Asp His Leu Glu Asn Thr Lys Gly Phe 
145 150 155 160 



Ala 



Asn Leu Arg Ala Leu Lys Tyr Leu Val Met Asp Glu Ala Asp Arg He 
165 "0 i75 
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Leu Asn Met Asp Phe 
180 



Glu Thr Glu Val Asp Lys He Leu Lys Val He 



185 



190 



Pro Arg Asp Arg Lys Thr Phe Leu Phe Ser Ala Thr Met Thr Lys Lys 

200 205 



195 



Val Gin Lys Leu 
210 



Gin Arg Ala Ala Leu Lys Asn Pro Val Lys Cys Ala 



215 



220 



Val Ser Ser Lys Tyr Gin Thr Val Glu Lys Leu Gin Gin Tyr Tyr lie 



225 



230 



235 



240 



Phe He Pro Ser Lys Phe Lys Asp Thr Tyr Leu Val Tyr lie Leu Asn 



Glu Leu Ala Gly 
260 

Thr Gin Arg Thr 
275 



245 250 255 

Asn Ser Phe Met He Phe Cys Ser Thr Cys. Asn Asn 



265 



270 



Ala Leu Leu Leu Arg Asn Leu Gly Phe Thr Ala lie 



280 



285 



Pro Leu His Gly Gin Met Ser Lys Arg Leu Gly Ser Leu Asn Lys Phe 



290 



295 



300 



Lys Ala Lys Ala Arg Ser lie Leu Leu Ala Thr Asp Val Ala Ser Arg 



305 



Gly Leu Asp 



Thr His Ser 



Gly Arg Ser 
355 



• 310 



315 



lie Pro His Val Asp Val Val Val Asn Phe Asp lie Pro 



335 



325 330 
Lys Asp Tyr He His Arg Val Gly Arg Thr Ala Arg Ala 



350 



340 345 

Gly Lys Ala He Thr Phe Val Thr Gin Tyr Asp Val Glu 



360 



365 



Leu Phe Gin Arg He Glu His Leu He Gly Lys Lys Leu Pro Gly Phe 

375 380 



370 



Pro Thr Gin Asp Asp 
385 



Glu Val Met Met Leu Thr Glu Arg Val Ala Glu 



390 



395 



400 



5 
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m 

W a Gin Arg Phe Ala Arg Met Glu Leu Arg Glu His Oly Glu ^ Lys 
405 "0 

. Lys Arg Ser Arg Glu Asp Ala Gly Asp Asn Asp Asp Thr Arg Gly Cys 
420 425 

435 44 ° 445 

teg Pro Leu He Thr Phe Met Lys Ala Arg Val Leu Leu Phe Cys Lys 
450 

^ G lu Leu Glu As, Glu Thr Cys Ser Asn Arg Asp His Glu Thr Glu 



465 



470 



He Gly Gin Asn Cys Val Gin Asn Val Leu Ser 



485 490 
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